Nucleic acid encoding TNFx protein

ABSTRACT

Nucleic acids encoding mammalian, e.g., primate or rodent, genes, purified proteins and fragments thereof. Antibodies, both polyclonal and monoclonal, are also provided. Methods of using the compositions for both diagnostic and therapeutic utilities are provided.

This application claims benefit of U.S. provisional patent application No. 60/231,267, filed Sep. 8, 2000.

FIELD OF THE INVENTION

The present invention relates to compositions and methods for affecting mammalian physiology, including morphogenesis or immune system function. In particular, it provides nucleic acids, proteins, and antibodies which regulate development and/or the immune system. Diagnostic and therapeutic uses of these materials are also disclosed.

BACKGROUND OF THE INVENTION

Recombinant DNA technology refers generally to techniques of integrating genetic information from a donor source into vectors for subsequent processing, such as through introduction into a host, whereby the transferred genetic information is copied and/or expressed in the new environment. Commonly, the genetic information exists in the form of complementary DNA (cDNA) derived from messenger RNA (mRNA) coding for a desired protein product. The carrier is frequently a plasmid having the capacity to incorporate cDNA for later replication in a host and, in some cases, actually to control expression of the cDNA and thereby direct synthesis of the encoded product in the host. See, e.g., Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual, (2d ed.), vols. 1-3, CSH Press, NY.

For some time, it has been known that the mammalian immune response is based on a series of complex cellular interactions, called the “immune network”. Recent research has provided new insights into the inner workings of this network. While it remains clear that much of the immune response does, in fact, revolve around the network-like interactions of lymphocytes, macrophages, granulocytes, and other cells, immunologists now generally hold the opinion that soluble proteins, known as lymphokines, cytokines, or monokines, play critical roles in controlling these cellular interactions. The interferons are generally considered to be members of the cytokine family. Thus, there is considerable interest in the isolation, characterization, and mechanisms of action of cell modulatory factors, an understanding of which will lead to significant advancements in the diagnosis and therapy of numerous medical abnormalities, e.g., immune system disorders.

Lymphokines apparently mediate cellular activities in a variety of ways. See, e.g., Paul (ed. 1998) Fundamental Immunology 4th ed., Lippincott; and Thomson (ed. 1998) The Cytokine Handbook 3d ed., Academic Press, San Diego. They have been shown to support the proliferation, growth, and/or differentiation of pluripotential hematopoietic stem cells into vast numbers of progenitors comprising diverse cellular lineages which make up a complex immune system. Proper and balanced interactions between the cellular components are necessary for a healthy immune response. The different cellular lineages often respond in a different manner when lymphokines are administered in conjunction with other agents.

Cell lineages especially important to the immune response include two classes of lymphocytes: B-cells, which can produce and secrete immunoglobulins (proteins with the capability of recognizing and binding to foreign matter to effect its removal), and T-cells of various subsets that secrete lymphokines and induce or suppress the B-cells and various other cells (including other T-cells) making up the immune network. These lymphocytes interact with many other cell types.

One means to modulate the effect of a cytokine upon binding to its receptor, and therefore potentially useful in treating inappropriate immune responses, e.g., autoimmune, inflammation, sepsis, and cancer situations, is to inhibit the receptor signal transduction. In order to characterize the structural properties of a cytokine receptor in greater detail and to understand the mechanism of action at the molecular level, purified receptor will be very useful. The receptors provided herein, by comparison to other receptors or by combining structural components, will provide further understanding of signal transduction induced by ligand binding.

An isolated receptor gene should provide means to generate an economical source of the receptor, allow expression of more receptors on a cell leading to increased assay sensitivity, promote characterization of various receptor subtypes and variants, and allow correlation of activity with receptor structures. Moreover, fragments of the receptor may be useful as agonists or antagonists of ligand binding. See, e.g., Harada, et al. (1992) J. Biol. Chem. 267:22752-22758. Often, there are at least two critical subunits in the functional receptor. See, e.g., Gonda and D'Andrea (1997) Blood 89:355-369; Presky, et al. (1996) Proc. Nat'l Acad. Sci. USA 93:14002-14007; Drachman and Kaushansky (1995) Curr. Opin. Hematol. 2:22-28; Theze (1994) Eur. Cytokine Netw. 5:353-368; and Lemmon and Schlessinger (1994) Trends Biochem. Sci. 19:459-463. Other receptor types, e.g., TLR-like, will similarly be useful.

Likewise, identification of novel ligands will be useful. Members of the tumor necrosis factor (TNF) family and transforming growth factor (TGF) family of ligands have identified physiological effects.

Finally, genes which exhibit disease associated expression patterns will be useful in diagnostic or other uses. The molecular diagnostic utility may be applied to identify patients who will be responsive to particular therapies, or to predict responsiveness to treatment.

From the foregoing, it is evident that the discovery and development of new soluble proteins and their receptors, including ones similar to lymphokines, should contribute to new therapies for a wide range of degenerative or abnormal conditions which directly or indirectly involve development, differentiation, or function, e.g., of the immune system and/or hematopoietic cells. Moreover, novel markers will be useful in molecular diagnosis or therapeutic methods. In particular, the discovery and understanding of novel receptors or lymphokine-like molecules which enhance or potentiate the beneficial activities of other lymphokines would be highly advantageous. The present invention provides these and related compounds, and methods for their use.

BRIEF DESCRIPTION OF THE DRAWINGS

FIGS. 1A-1C show a sequence alignment of related IFN receptor family members. Tissue Factor is SEQ ID NO: 4; DISR4 is SEQ ID NO:2; hIFNabR is SEQ ID NO: 5; CRF2-4 is SEQ ID NO: 6; cytor x is SEQ ID NO: 7; and cytor7 is SEQ ID NO: 8.

FIG. 2 shows an alignment of TNF-x and TNF-y polypeptides (SEQ ID NO:9, 11, and 13); p is primate, r is rodent.

FIGS. 3A-3F show an alignment of primate TLR-like protein sequences. TLRL1_HU is SEQ ID NO:15; TLRL2_HU is SEQ ID NO:19; TLRL4_HU is SEQ ID NO:25; TLRL3_HU IS SEQ ID NO:23; and, TLRL5_HU is SEQ ID NO:27;

FIG. 4 shows an Alignment of primate and rodent 5685C6 polypeptide sequences. r8685C6 is SEQ ID NO:33 and p5685C6 is SEQ ID NO:31.

FIG. 5 shows an alignment of Claudin homologs: D2 (SEQ ID NO:34); D8 (SEQ ID NO:36); D17 (SEQ ID NO:39); D7.2 (SEQ ID NO:41).

FIGS. 6A-6F show an alignment of Schlafen homologs: schlafen B (SEQ ID NO:43); schlafen C (SEQ ID NO:45); schlafen D (SEQ ID NO:47); schlafen E (SEQ ID NO:49); and schlafen F (SEQ ID NO:51).

SUMMARY OF THE INVENTION

The present invention is directed to novel genes, e.g., primate embodiments. These genes include receptors related to cytokine receptors, e.g., cytokine receptor like molecular structures, designated DNAX Interferon-like Receptor Subunit 4 (DIRS4); TNF related cytokines designated TNFx and TNFy; Toll-like receptor like molecules designated TLR-L1, TLR-L2, TLR-L3, TLR-L4, and TLR-L5; a TGF related molecule designated TGFx; a soluble Th2 cell produced entity designated 5685C6; a group of genes related to ones whose expression patterns correlate with medical conditions designated claudins, herein referred to as claudins D2, D8, D17, and D7.2; and a second group of genes related to ones whose expression patterns correlate with medical conditions designated schlafens, herein referred to as schlafens B, C, D, E, and F.

In particular, the present invention provides a composition of matter selected from: a substantially pure or recombinant polypeptide comprising at least three distinct nonoverlapping segments of at least four amino acids identical to segments of: SEQ ID NO: 2 (DIRS4); SEQ ID NO: 9, 11, 13, or 53 (TNFx or TNFy); SEQ ID NO: 15, 17, 19, 21, 23, 25, or 27 (TLR-L1 through TLR-L5); SEQ ID NO: 29 (TGFx): SEQ ID NO: 31 or 33 (5685C6); SEQ ID NO: 35, 37, 39, or 41 (claudins); SEQ ID NO: 43, 45, 47, 49, or 51 (schlafens). In preferred embodiments, the distinct nonoverlapping segments of identity: include one of at least eight amino acids; include one of at least four amino acids and a second of at least five amino acids; include at least three segments of at least four, five, and six amino acids; or include one of at least twelve amino acids. In certain embodiments, the polypeptide: is unglycosylated; is from a primate, such as a human; comprises at least contiguous seventeen amino acids of the SEQ ID NO; exhibits at least four nonoverlapping segments of at least seven amino acids of the SEQ ID NO; has a length at least about 30 amino acids; has a molecular weight of at least 30 kD with natural glycosylation; is a synthetic polypeptide; is attached to a solid substrate; is conjugated to another chemical moiety; or comprises a detection or purification tag, including a FLAG, His6, or Ig sequence. In other embodiments, the composition comprises: a substantially pure polypeptide; a sterile polypeptide; or the polypeptide and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration.

Kit embodiments include those comprising such a polypeptide, and: a compartment comprising the polypeptide; or instructions for use or disposal of reagents in the kit.

Binding compound embodiments include those comprising an antigen binding site from an antibody, which specifically binds to a described polypeptide, wherein: the binding compound is in a container; the polypeptide is from a human; the binding compound is an Fv, Fab, or Fab2 fragment; the binding compound is conjugated to another chemical moiety; or the antibody: is raised to a recombinant polypeptide; is raised to a purified polypeptide; is immunoselected; is a polyclonal antibody; binds to a denatured antigen; exhibits a Kd to antigen of at least 30 _M; is attached to a solid substrate, including a bead or plastic membrane; is in a sterile composition; or is detectably labeled, including a radioactive or fluorescent label.

Kit embodiments include those comprising such a binding compound, and: a compartment comprising the binding compound; or instructions for use or disposal of reagents in the kit.

Methods are provided, e.g., for producing an antigen:antibody complex, comprising contacting under appropriate conditions a primate polypeptide with such a described antibody, thereby allowing the complex to form. Also provided are methods of producing an antigen:antibody complex, comprising contacting under appropriate conditions a polypeptide with an antibody which binds thereto, thereby allowing the complex to form. And methods are provided to produce a binding compound comprising: immunizing an immune system with a polypeptide described; introducing a nucleic acid encoding the described polypeptide to a cell under conditions leading to an immune response, thereby producing said binding compound; or selecting for a phage display library for those phage which bind to the desired polypeptide.

Further compositions are provided, e.g., comprising: a sterile binding compound, or the binding compound and a carrier, wherein the carrier is: an aqueous compound, including water, saline, and/or buffer; and/or formulated for oral, rectal, nasal, topical, or parenteral administration.

Nucleic acid embodiments are provided, e.g., an isolated or recombinant nucleic acid encoding a polypeptide described, wherein the: polypeptide is from a primate; or the nucleic acid: encodes an antigenic polypeptide; encodes a plurality of antigenic polypeptide sequences of SEQ ID NO:2, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, or 53; exhibits identity over at least thirteen nucleotides to a natural cDNA encoding the segment; is an expression vector; further comprises an origin of replication; is from a natural source; comprises a detectable label; comprises synthetic nucleotide sequence; is less than 6 kb, preferably less than 3 kb; is a hybridization probe for a gene encoding the polypeptide; or is a PCR primer, PCR product, or mutagenesis primer.

Various embodiments also include cells comprising the recombinant nucleic acids, particularly wherein the cell is: a prokaryotic cell; a eukaryotic cell; a bacterial cell; a yeast cell; an insect cell; a mammalian cell; a mouse cell; a primate cell; or a human cell.

Kit embodiments include those comprising a described nucleic acid, and: a compartment comprising the nucleic acid; a compartment further comprising a primate polypeptide; or instructions for use or disposal of reagents in the kit.

Other nucleic acids are provided which: hybridize under wash conditions of 30 minutes at 37° C. and less than 2M salt to the coding portion of SEQ ID NO: 1, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50 or 52; or exhibit identity over a stretch of at least about 30 nucleotides to a SEQ ID NO: 1, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, or 52. Preferably, the wash conditions are at 45° C. and/or 500 mM salt, or at 55° C. and/or 150 mM salt; or the stretch is at least 55 or 75 nucleotides.

Methods are provided, e.g., for making: a duplex nucleic acid comprising contacting: a described nucleic acid with a complementary nucleic acid, under appropriate conditions, thereby resulting in hybridization to form the complex; or a nucleic acid complementary to a described nucleic acid with its complementary nucleic acid, under appropriate conditions, thereby resulting in hybridization to form the complex; or a polypeptide comprising culturing a cell comprising a described nucleic acid under conditions resulting in expression of the nucleic acid.

And methods are provided to: modulate physiology or development of a cell comprising contacting the cell with a polypeptide comprising SEQ ID NO: 9, 11, 13, 29, 31, or 33; modulate physiology or development of a cell comprising contacting the cell with a binding compound which binds to SEQ ID NO: 9, 11, 13, 29, 31, 33 or 53, thereby blocking signaling mediated by a protein comprising the SEQ ID NO; label a cell comprising contacting the cell with a binding compound which binds to SEQ ID NO: 15, 17, 19, 21, 13, 15, or 37; or diagnose a medical condition comprising a step of evaluating expression of nucleic acid comprising SEQ ID NO: 34, 36, 38, 40, 42, 44, 46, 48, or 50.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

I. General

The present invention provides the amino acid sequences and nucleic acid sequences of mammalian, herein primate, genes. Among them is an interferon receptor-like subunit molecule, one designated DNAX Interferon Receptor family Subunit 4 (DIRS4), having particular defined properties, both structural and biological. Others include molecules designated TNFx and TNFy; Toll like receptor like molecules TLR-L1, TLR-L2, TLR-L3, TLR-L4, and TLR-L5; TGFx; 5685C6; claudins D2, D8, D17, and D7.2; and schlafens B, C, D, E, and F. Various cDNAs encoding these molecules were obtained from primate, e.g., human, cDNA sequence libraries. Other primate or other mammalian counterparts would also be desired. In certain cases, alternative splice variants should be available.

Some of the standard methods applicable are described or referenced, e.g., in Maniatis, et al. (1982) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor Press; Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual, (2d ed.), vols. 1-3, CSH Press, NY; Ausubel, et al., Biology, Greene Publishing Associates, Brooklyn, N.Y.; or Ausubel, et al. (1987 and periodic supplements) Current Protocols in Molecular Biology, Greene/Wiley, New York; each of which is incorporated herein by reference.

A nucleotide and corresponding amino acid sequence for a primate, e.g., human DIRS4 coding segment is shown in SEQ ID NO: 1 and 2, respectively. The new DIRS4 lacks a transmembrane segment, which suggests that the subunit acts as a soluble subunit, and would thus be an alpha receptor subunit. Alternatively, or in addition, a splice variant would exist which contains a transmembrane segment. This is consistent with the observation that two transcripts are found in many cell types. Interferon receptor like subunits may be receptors for the IL-10 family of ligands, e.g., IL-10, AK155, IL-19, IL-20/mda-7, AK155, IL-D110, IL-D210, etc. See, e.g., Derwent patent sequence database.

Also provided are nucleotide (SEQ ID NO: 8, 10, 12, and 52) and corresponding amino acid sequences (SEQ ID NO: 9, 11, 13, and 53) for primate and rodent forms of TNFx and primate and rodent forms of TNFy. Features for primate TNFx include: cAMP PKsites about 38, 74, 79, 205; Cas Phos sites about 41, 61; Cyt_c-Mesite about 43; Histone-Me site about 35; Myristoly sites about 5, 57, 220, 232 N-GLYCOSYL site about 229; PHOS2 sites about 38-41, 79-82, 134-136; PKC ph sites about 77, 142. Also segments 119-250, and 209-221 are notable. For rodent TNFx, features include: A predicted signal 1-19; mature would begin at about 20. Other features: cAMP PK sites at about 34, 93, 132, 229, 248, 263; Cas Phos sites about 119, 232, 251; Cyt_c-Me sites about 26, 90, 172; Histone-Me site about 82; Myristoly sites around 278, 290, 303; N-GLYCOSYL: 3 sites about 39, 287, 297; PHOS2 sites about 26-29, 34-37, 90-92, 93-96, 138-140, 192-194, 248-251; and PKC ph sites about 43, 51, 80, 81, 152; TyKinsite about 154. Signal cleavage site predicted between pos. 19 and 20: AGA-GA. Other significant segments include from about 74-132, 94-118, 168-308, and 193-201.

Nucleotide and corresponding amino acid sequences for TLR-L1 through TLR-L5 are provided in SEQ ID NO:14-27. The EST distribution for TLR1 suggests mRNA expression is restricted to brain tissue; chromosome Xq27.1-28 coding region is on a single exon. Features for primate TLR1 (SEQ ID NO:15) include: Tyr Kin site about 704 (KEGDPVAY); Tyr Kin sites about 713 (RNLQEFSY), 825 (KPQSEPDY); N-GLYCOSYL sites about 84 (NYS), 219 (NCT), 294 (NPT), 366 (NIS), 421 (NLT), 583 (NLS); likely a Type Ia membrane protein; a possible uncleavable N-term signal sequence; and a transmembrane prediction of about 618-634 <612-646>. For rodent TLR-L1 (SEQ ID NO:17), the features include: A predicted transmembrane segment from about residues 56-75; and predicted TyKin sites at about residues 136 and 145.

For primate TLR-L2 (SEQ ID NO:19) features include: N-glycosyl sites about 82 (NYT), 217 (NCS), 623 (NST), 674 (NQS); TyKin sites about 889 (RLREPVLY), 450 (RLSPELFY), 917 (KLNVEPDY); TyKin site about 889 (RLREPVLY), 917 (KLNVEPDY). Structurally this molecule has homology to type Ia membrane proteins.

Primate TLR-L3 (SEQ ID NO:23) has the following features: SIGNAL 1-26; TRANS14-34; Pfam:LRRNT 43-73; Pfam:LRR 78-101; LRR_TYP 100-123; Pfam:LRR 102-125; LRR_TYP 124-147; Pfam:LRR 126-149; LRR_TYP 148-171; Pfam:LRR 150-173; LRR_TYP 172-195; LRR_PS172-194; Pfam:LRR 174-197; LRR_TYP 196-219; LRRCT 232-282; Pfam:LRRCT 232-282 with SEG 331-349 or SEG 365-379; Pfam:LRRNT 372-405; LRRNT 372-410; Pfam:LRR 409-432; LRR_TYP 431-454; Pfam:LRR 433-456; LRR_PS 455-477; LRR_TYP 455-478; Pfam:LRR 457-480; LRR_TYP 479-502; Pfam:LRR 481-504 with SEG 502-519; LRR_TYP 503-526; LRR_PS 503-525; Pfam:LRR 505-528; Pfam:LRRCT 562-612; LRRCT 562-612; TRANS 653-673; SEG 653-676; SEG 712-723; SEG 760-776; SEG 831-855. Structurally this molecule has homology to type Ia membrane proteins.

Primate TLR-L4 (SEQ ID NO:25) EST distributions suggest mRNA expression is restricted to brain tissue; human chromosome Xq26.3-28; predicted features at about, e.g., SIGNAL 1-18; SEG 22-38; Pfam:LRR 60-83; LRR_TYP 82-105; Pfam:LRR 84-107; LRR_PS106-128; LRR_TYP 106-129; Pfam:LRR 108-131; LRR_TYP 130-153; Pfam:LRR 132-155; LRR_SD22 154-174; LRR_PS154-176; LRR_TYP 154-177; Pfam:LRR 156-178; LRR_SD22 177-198; LRR_PS177-198; LRR_TYP 178-201; Pfam:LRR 179-200; Pfam:LRRCT 213-263; LRRCT 213-263; LRRNT 341-379; Pfam:LRRNT 341-374; Pfam:LRR 378-401; LRR_TYP 400-423; LRR_SD22 400-421; Pfam:LRR 402-425; LRR_TYP 424-447; LRR_SD22 424-450; LRR_PS 424-447; Pfam:LRR 426-449; LRR_TYP 448-471; LRR_PS 448-470; Pfam:LRR 450-473; LRR_TYP 472-495; LRR_PS 472-494; Pfam:LRR 474-497; SEG 474-488; LRRCT 531-581; Pfam:LRRCT 531-581; SEG 617-643; TRANS 623-643; N-GLYCOSYL sites about 81 (NFS), 216 (NCS), 308 (NPS), 325 (NLS), 423 (NLT); chromosome Xq26.3-28; coding region is on a single exon. Structurally this molecule appears to be a Type Ia membrane protein.

For primate TLR-L5 (SEQ ID NO:27) the entire coding region lies on a single exon on human chromosome 13; predicted features at about, e.g., SIGNAL 1-20; Pfam:LRR 65-88; LRR_TYP 87-110; Pfam:LRR 89-112; LRR_TYP 111-134; Pfam:LRR 113-136; LRR_PS 135-157; LRR_SD22 135-156; LRR_TYP 135-158; Pfam:LRR 137-160; LRR_TYP 159-182; LRR_SD22 159-177; LRR_PS159-181; Pfam:LRR 161-184; LRR_SD22 182-203; LRR_TYP 185-206; Pfam:LRR 185-205; LRRCT 218-268; Pfam:LRRCT 218-268; Hybrid:LRRNT 328-364; Pfam:LRRNT 328-360; LRR_SD22 386-407; Pfam:LRR 388-411; LRR_TYP 389-409; LRR_PS 410-432; LRR_TYP 410-433; LRR_SD22 410-428; Pfam:LRR 412-435; LRR_SD22 434-453; LRR_PS 434-457; LRRTYP 434-457; Pfam:LRR 436-459; SEG 436-445; LRR_PS 458-480; LRR_SD22 458-484; LRR_TYP 458-481; SEG 459-476; Pfam:LRR 460-483; SEG 503-516; LRRCT 517-567; Pfam:LRRCT 517-567; SEG 585-596; TRANS 607-627; SEG 701-710; N-GLYCOSYL 3 sites about 292 (NDS), 409 (NLT), 572 (NPS); TyKin site about 798 (KLMETLMY).

Nucleotide and corresponding amino acid sequences for a primate, e.g., human, TGFx coding segment, are represented by SEQ ID NO:28 and 29, respectively. Human TGFx maps to chromosome 5 (clone CITB-H1_(—)2319M24). Predicted features (SEQ ID NO: 29) include: TGFB domain 115-212; Pfam:TGF-beta 115-167; Pfam:TGF-beta 205-212; TGF-beta like conserved Cys residues at positions 115, 144, 148, 177, 209, 211.

Nucleotide and corresponding amino acid sequences for 5685C6 coding segments are presented in SEQ ID NO:30-33. The primate clone maps to chromosome 21q22.1. Features of primate 5685C6 (SEQ ID NO:31) include: N-GLYCOSYL sites about 10 (NST), 23 (NCS), 76 (NFT), 169 (NVT), 191 (NKS); most likely cleavage site predicted between pos. 19 and 20: VFA-LN. The secreted protein produced by Th2 cells. The corresponding rodent polypeptide (SEQ ID NO:33) has the following features Predicted features: N-GLYCOSYL sites about 6 (NNT), 19 (NCS), 159 (NRS); most likely cleavage site between pos. 26 and 27: TKA-QN. 5685C6 molecules appear to be soluble entities which are expressed in Th2 clones. The entities are useful markers of Th2 cells, and will be useful in characterizing such cell types.

Nucleotide and corresponding amino acid sequences for claudins D2, D8, D17, and D7.2 are SEQ ID NO:34-41 (See, e.g., Simon, et al. (1999) Science 285:103-106).

Nucleotide and corresponding amino acid sequences for schlafens B, C, D, E, and F (see, e.g., see Schwarz, et al. (1998) Immunity 9:657-668) are SEQ ID NO:42-51.

As used herein, the term DIRS4 shall be used to describe a protein comprising a protein or peptide segment having or sharing the amino acid sequence shown in the SEQ ID NOs noted above, or a substantial fragment thereof. The invention also includes a protein variation of the respective DIRS4 allele whose sequence is provided, e.g., a mutein or soluble extracellular construct. Typically, such agonists or antagonists will exhibit less than about 10% sequence differences, and thus will often have between 1- and 11-fold substitutions, e.g., 2-, 3-, 5-, 7-fold, and others. It also encompasses allelic and other variants, e.g., natural polymorphic, of the protein described. Typically, it will bind to its corresponding biological ligand, perhaps in a dimerized state with a second receptor subunit, with high affinity, e.g., at least about 100 nM, usually better than about 30 nM, preferably better than about 10 nM, and more preferably at better than about 3 nM. The term shall also be used herein to refer to related naturally occurring forms, e.g., alleles, polymorphic variants, and metabolic variants of the mammalian protein.

Likewise, reference to the other genes described herein will be made. General descriptions directed to the methods of making or structural features will often be applicable to the other entities provided herein, e.g., the TNFx, TNFy, TLR-L1, TLR-L2, TLR-L3, TLR-L4, TLR-L5, TGFx, 5685C6, claudins D2, D8, D17, D7.2, and schlafens B, C, D, E, and F. Antibodies thereto, nucleic acids encoding them, etc., will be similarly applicable to the different entities.

This invention also encompasses proteins or peptides having substantial amino acid sequence identity with the amino acid sequences. It will include sequence variants with relatively few substitutions, e.g., preferably less than about 3-5.

A substantial polypeptide “fragment”, or “segment”, is a stretch of amino acid residues of at least about 8 amino acids, generally at least 10 amino acids, more generally at least 12 amino acids, often at least 14 amino acids, more often at least 16 amino acids, typically at least 18 amino acids, more typically at least 20 amino acids, usually at least 22 amino acids, more usually at least 24 amino acids, preferably at least 26 amino acids, more preferably at least 28 amino acids, and, in particularly preferred embodiments, at least about 30 or more amino acids. Sequences of segments of different proteins can be compared to one another over appropriate length stretches.

Fragments may have ends which begin and/or end at virtually all positions, e.g., beginning at residues 1, 2, 3, etc., and ending at, e.g., the carboxy-terminus (N), N-1, N-2, etc., in all practical combinations of different lengths. Particularly interesting polypeptides have one or both ends corresponding to structural domain or motif boundaries, as described, or of the designated lengths with one end adjacent one of the described boundaries. In nucleic acid embodiments, often segments which encode such polypeptides would be of particular interest.

Amino acid sequence homology, or sequence identity, is determined by optimizing residue matches. In some comparisons, gaps may be introduces, as required. See, e.g., Needleham, et al. (1970) J. Mol. Biol. 48:443-453; Sankoff, et al. (1983) chapter one in Time Warps, String Edits, and Macromolecules: The Theory and Practice of Sequence Comparison, Addison-Wesley, Reading, Mass.; and software packages from IntelliGenetics, Mountain View, Calif.; and the University of Wisconsin Genetics Computer Group (GCG), Madison, Wis.; each of which is incorporated herein by reference. This analysis is especially important when considering conservative substitutions as matches. Conservative substitutions typically include substitutions within the following groups: glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Homologous amino acid sequences are intended to include natural allelic and interspecies variations in the cytokine sequence. Typical homologous proteins or peptides will have from 50-100% homology (if gaps can be introduced), to 60-100% homology (if conservative substitutions are included) with an amino acid sequence segment of the appropriate SEQ ID NOs noted above. Homology measures will be at least about 70%, generally at least 76%, more generally at least 81%, often at least 85%, more often at least 88%, typically at least 90%, more typically at least 92%, usually at least 94%, more usually at least 95%, preferably at least 96%, and more preferably at least 97%, and in particularly preferred embodiments, at least 98% or more. The degree of homology will vary with the length of the compared segments. Homologous proteins or peptides, such as the allelic variants, will share most biological activities with the embodiments described individually, e.g., in the various tables.

As used herein, the term “biological activity” is used to describe, without limitation, effects on inflammatory responses, innate immunity, and/or morphogenic development by cytokine-like ligands. For example, the receptors typically should mediate phosphatase or phosphorylase activities, which activities are easily measured by standard procedures. See, e.g., Hardie, et al. (eds. 1995) The Protein Kinase FactBook vols. I and II, Academic Press, San Diego, Calif.; Hanks, et al. (1991) Meth. Enzymol. 200:38-62; Hunter, et al. (1992) Cell 70:375-388; Lewin (1990) Cell 61:743-752; Pines, et al. (1991) Cold Spring Harbor Symp. Quant. Biol. 56:449-463; and Parker, et al. (1993) Nature 363:736-738. The receptors, or portions thereof, may be useful as phosphate labeling enzymes to label general or specific substrates.

The terms ligand, agonist, antagonist, and analog of, e.g., a DIRS4_ include molecules that modulate the characteristic cellular responses to cytokine ligand proteins, as well as molecules possessing the more standard structural binding competition features of ligand-receptor interactions, e.g., where the receptor is a natural receptor or an antibody. The cellular responses likely are typically mediated through receptor tyrosine kinase pathways.

Also, a ligand is a molecule which serves either as a natural ligand to which said receptor, or an analog thereof, binds, or a molecule which is a functional analog of the natural ligand. The functional analog may be a ligand with structural modifications, or may be a wholly unrelated molecule which has a molecular shape which interacts with the appropriate ligand binding determinants. The ligands may serve as agonists or antagonists, see, e.g., Goodman, et al. (eds. 1990) Goodman & Gilman's: The Pharmacological Bases of Therapeutics, Pergamon Press, New York.

Rational drug design may also be based upon structural studies of the molecular shapes of a receptor or antibody and other effectors or ligands. See, e.g., Herz, et al. (1997) J. Recept. Signal Transduct. Res. 17:671-776; and Chaiken, et al. (1996) Trends Biotechnol. 14:369-375. Effectors may be other proteins which mediate other functions in response to ligand binding, or other proteins which normally interact with the receptor. One means for determining which sites interact with specific other proteins is a physical structure determination, e.g., x-ray crystallography or 2 dimensional NMR techniques. These will provide guidance as to which amino acid residues form molecular contact regions. For a detailed description of protein structural determination, see, e.g., Blundell and Johnson (1976) Protein Crystallography, Academic Press, New York, which is hereby incorporated herein by reference.

II. Activities

The cytokine receptor-like proteins will have a number of different biological activities, e.g., modulating cell proliferation, or in phosphate metabolism, being added to or removed from specific substrates, typically proteins. Such will generally result in modulation of an inflammatory function, other innate immunity response, or a morphological effect. The subunit will probably have a specific low affinity binding to the ligand.

Different receptors may mediate different signals. The TLR-L receptors may signal similar biology to the TLRs, which mediate fundamental innate immune or developmental responses. See, e.g., Aderem adn Ulevitch (2000) Nature 406:782-787. The TNFs and TGF are likely to signal as cytokines, as may the 5685C6, which seemingly is expressed by Th2 cells. The 5685C6 genes appear to be secreted proteins, which exhibit a cleavable signal sequence.

The claudins appear to be membrane proteins exhibiting 4 transmembrane segments, and seem to be associated with tight junctions and/or paracellular transport. They may also affect epithelial permeability or conductances, e.g., ion, across membranes. The claudin-D2 member of the claudin family is found to have regulated expression correlating with Crohn's disease. The other family members exhibit differential regulation in disease states, e.g., in Crohn's disease, ulcerative colitis, and various interstitial lung diseases. This is consistent with an important role in these disease processes. A functional role in the tight junctions/paracellular transport is consistent with problems in intestinal physiology.

Claudins define a structurally related multi-gene family of 4 TM proteins with distinct tissue distribution patterns. The claudins are major structural proteins of tight junctions (TJs) and can promote their formation. Their expression is necessary but not sufficient for tight junction formation. When expressed in fibroblasts, claudin-1 is capable of inducing a continuous association of adjacent cells, resulting in a cobblestone like pattern. However, this continuous barrier is not a tight junction. Claudins can be found outside of tight junction in certain cells. Claudin-3 and claudin-4 are receptors for Clostridium perfringens enterotoxin, a causative agent of fluid accumulation in the intestinal tract, causing diarrhea. Claudin-5 is deleted in Velo-cardio-facial syndrome (VCFS). Claudin-5 is only expressed in endothelial cells, and in some tissues it is even further restricted to arterials.

Mutations in Paracellin-1, claudin family member and a major renal tight junction protein, cause renal magnesium wasting with nephrocalcinosis. Thus, claudins may play important roles in selective paracellular conductance by determining the permeability of different epithelia.

The schlafens are members of a family of proteins of whose members are growth regulatory genes. See, e.g., Schwarz, et al. (1998) Immunity 9:657-668. These novel human sequences are related to the mouse Schlafen2 gene. It was observed to be differentially regulated in mouse IBD: Rag Hh+ (IL-10 treated) colon expression was higher than Rag Hh+alone and mimicked the expression of Rag Hh−.

The DIRS4 has the characteristic extracellular motifs of a receptor signaling through the JAK pathway. See, e.g., Ihle, et al. (1997) Stem Cells 15 (suppl. 1):105-111; Silvennoinen, et al. (1997) APMIS 105:497-509; Levy (1997) Cytokine Growth Factor Review 8:81-90; Winston and Hunter (1996) Current Biol. 6:668-671; Barrett (1996) Baillieres Clin. Gastroenterol. 10:1-15; and Briscoe, et al. (1996) Philos. Trans. R. Soc. Lond. B. Biol. Sci. 351:167-171.

The biological activities of the cytokine or other receptor subunits will be related to addition or removal of phosphate moieties to substrates, typically in a specific manner, but occasionally in a non specific manner. Substrates may be identified, or conditions for enzymatic activity may be assayed by standard methods, e.g., as described in Hardie, et al. (eds. 1995) The Protein Kinase FactBook vols. I and II, Academic Press, San Diego, Calif.; Hanks, et al. (1991) Meth. Enzymol. 200:38-62; Hunter, et al. (1992) Cell 70:375-388; Lewin (1990) Cell 61:743-752; Pines, et al. (1991) Cold Spring Harbor Symp. Quant. Biol. 56:449-463; and Parker, et al. (1993) Nature 363:736-738.

III. Nucleic Acids

This invention contemplates use of isolated nucleic acid or fragments, e.g., which encode these or closely related proteins, or fragments thereof, e.g., to encode a corresponding polypeptide, preferably one which is biologically active. In addition, this invention covers isolated or recombinant DNAs which encode such proteins or polypeptides having characteristic sequences of the DIRS4 or the other genes. Typically, the nucleic acid is capable of hybridizing, under appropriate conditions, with a nucleic acid sequence segment shown in the appropriate SEQ ID NOs noted above, but preferably not with other genes. Said biologically active protein or polypeptide can be a full length protein, or fragment, and will typically have a segment of amino acid sequence highly homologous, e.g., exhibiting significant stretches of identity, to ones described. Further, this invention covers the use of isolated or recombinant nucleic acid, or fragments thereof, which encode proteins having fragments which are equivalent to the described proteins. The isolated nucleic acids can have the respective regulatory sequences in the 5′ and 3′ flanks, e.g., promoters, enhancers, poly-A addition signals, and others from the natural gene.

An “isolated” nucleic acid is a nucleic acid, e.g., an RNA, DNA, or a mixed polymer, which is substantially pure, e.g., separated from other components which naturally accompany a native sequence, such as ribosomes, polymerases, and flanking genomic sequences from the originating species. The term embraces a nucleic acid sequence which has been removed from its naturally occurring environment, and includes recombinant or cloned DNA isolates, which are thereby distinguishable from naturally occurring compositions, and chemically synthesized analogs or analogs biologically synthesized by heterologous systems. A substantially pure molecule includes isolated forms of the molecule, either completely or substantially pure.

An isolated nucleic acid will generally be a homogeneous composition of molecules, but will, in some embodiments, contain heterogeneity, preferably minor. This heterogeneity is typically found at the polymer ends or portions not critical to a desired biological function or activity.

A “recombinant” nucleic acid is typically defined either by its method of production or its structure. In reference to its method of production, e.g., a product made by a process, the process is use of recombinant nucleic acid techniques, e.g., involving human intervention in the nucleotide sequence. Typically this intervention involves in vitro manipulation, although under certain circumstances it may involve more classical animal breeding techniques. Alternatively, it can be a nucleic acid made by generating a sequence comprising fusion of two fragments which are not naturally contiguous to each other, but is meant to exclude products of nature, e.g., naturally occurring mutants as found in their natural state. Thus, for example, products made by transforming cells with an unnaturally occurring vector is encompassed, as are nucleic acids comprising sequence derived using any synthetic oligonucleotide process. Such a process is often done to replace a codon with a redundant codon encoding the same or a conservative amino acid, while typically introducing or removing a restriction enzyme sequence recognition site. Alternatively, the process is performed to join together nucleic acid segments of desired functions to generate a single genetic entity comprising a desired combination of functions not found in the commonly available natural forms, e.g., encoding a fusion protein. Restriction enzyme recognition sites are often the target of such artificial manipulations, but other site specific targets, e.g., promoters, DNA replication sites, regulation sequences, control sequences, or other useful features may be incorporated by design. A similar concept is intended for a recombinant, e.g., fusion, polypeptide. This will include a dimeric repeat. Specifically included are synthetic nucleic acids which, by genetic code redundancy, encode equivalent polypeptides to fragments of the described sequences and fusions of sequences from various different related molecules, e.g., other cytokine receptor family members.

A “fragment” in a nucleic acid context is a contiguous segment of at least about 17 nucleotides, generally at least 21 nucleotides, more generally at least 25 nucleotides, ordinarily at least 30 nucleotides, more ordinarily at least 35 nucleotides, often at least 39 nucleotides, more often at least 45 nucleotides, typically at least 50 nucleotides, more typically at least 55 nucleotides, usually at least 60 nucleotides, more usually at least 66 nucleotides, preferably at least 72 nucleotides, more preferably at least 79 nucleotides, and in particularly preferred embodiments will be at least 85 or more nucleotides. Typically, fragments of different genetic sequences can be compared to one another over appropriate length stretches, particularly defined segments such as the domains described below.

A nucleic acid which codes for, e.g., a DIRS4, will be particularly useful to identify genes, mRNA, and cDNA species which code for itself or closely related proteins, as well as DNAs which code for polymorphic, allelic, or other genetic variants, e.g., from different individuals or related species. Other genes will be useful as markers for particular cell types, or diagnostic of various physiological conditions. Preferred probes for such screens may, in certain circumstances, be those regions of the gene which are conserved between different polymorphic variants or which contain nucleotides which lack specificity, and will preferably be full length or nearly so. In other situations, polymorphic variant specific sequences will be more useful.

This invention further covers recombinant nucleic acid molecules and fragments having a nucleic acid sequence identical to or highly homologous to the isolated DNA set forth herein. In particular, the sequences will often be operably linked to DNA segments which control transcription, translation, and DNA replication. Alternatively, recombinant clones derived from the genomic sequences, e.g., containing introns, will be useful for transgenic studies, including, e.g., transgenic cells and organisms, and for gene therapy. See, e.g., Goodnow (1992) “Transgenic Animals” in Roitt (ed.) Encyclopedia of Immunology Academic Press, San Diego, pp. 1502-1504; Travis (1992) Science 256:1392-1394; Kuhn, et al. (1991) Science 254:707-710; Capecchi (1989) Science 244:1288; Robertson (1987) (ed.) Teratocarcinomas and Embryonic Stem Cells: A Practical Approach IRL Press, Oxford; and Rosenberg (1992) J. Clinical Oncology 10:180-199. Operable association of heterologous promoters with natural gene sequences is also provided, as are vectors encoding, e.g., the DIRS4 with a receptor partner. See, e.g., Treco, et al. WO96/29411 or U.S. Ser. No. 08/406,030.

Homologous, or highly identical, nucleic acid sequences, when compared to one another, e.g., DIRS4 sequences, exhibit significant similarity. The standards for homology in nucleic acids are either measures for homology generally used in the art by sequence comparison or based upon hybridization conditions. Comparative hybridization conditions are described in greater detail below.

Substantial identity in the nucleic acid sequence comparison context means either that the segments, or their complementary strands, when compared, are identical when optimally aligned, with appropriate nucleotide insertions or deletions, in at least about 60% of the nucleotides, generally at least 66%, ordinarily at least 71%, often at least 76%, more often at least 80%, usually at least 84%, more usually at least 88%, typically at least 91%, more typically at least about 93%, preferably at least about 95%, more preferably at least about 96 to 98% or more, and in particular embodiments, as high at about 99% or more of the nucleotides, including, e.g., segments encoding structural domains such as the segments described below. Alternatively, substantial identity will exist when the segments will hybridize under selective hybridization conditions, to a strand or its complement, typically using a described sequence. Typically, selective hybridization will occur when there is at least about 55% homology over a stretch of at least about 14 nucleotides, more typically at least about 65%, preferably at least about 75%, and more preferably at least about 90%. See, Kanehisa (1984) Nucl. Acids Res. 12:203-213, which is incorporated herein by reference. The length of homology comparison, as described, may be over longer stretches, and in certain embodiments will be over a stretch of at least about 17 nucleotides, generally at least about 20 nucleotides, ordinarily at least about 24 nucleotides, usually at least about 28 nucleotides, typically at least about 32 nucleotides, more typically at least about 40 nucleotides, preferably at least about 50 nucleotides, and more preferably at least about 75 to 100 or more nucleotides. This includes, e.g., 125, 150, 175, 200, 225, 250, 275, 300, 400, 500, 700, 900, and other lengths.

Stringent conditions, in referring to homology in the hybridization context, will be stringent combined conditions of salt, temperature, organic solvents, and other parameters typically controlled in hybridization reactions. Stringent temperature conditions will usually include temperatures in excess of about 30° C., more usually in excess of about 37° C., typically in excess of about 45° C., more typically in excess of about 55° C., preferably in excess of about 65° C., and more preferably in excess of about 70° C. Stringent salt conditions will ordinarily be less than about 500 mM, usually less than about 400 mM, more usually less than about 300 mM, typically less than about 200 mM, preferably less than about 100 mM, and more preferably less than about 80 mM, even down to less than about 20 mM. However, the combination of parameters is much more important than the measure of any single parameter. See, e.g., Wetmur and Davidson (1968) J. Mol. Biol. 31:349-370, which is hereby incorporated herein by reference.

The isolated DNA can be readily modified by nucleotide substitutions, nucleotide deletions, nucleotide insertions, and inversions of nucleotide stretches. These modifications result in novel DNA sequences which encode this protein or its derivatives. These modified sequences can be used to produce mutant proteins (muteins) or to enhance the expression of variant species. Enhanced expression may involve gene amplification, increased transcription, increased translation, and other mechanisms. Such mutant derivatives include predetermined or site-specific mutations of the protein or its fragments, including silent mutations using genetic code degeneracy. “Mutant DIRS4” as used herein encompasses a polypeptide otherwise falling within the homology definition of the DIRS4 as set forth above, but having an amino acid sequence which differs from that of other cytokine receptor-like proteins as found in nature, whether by way of deletion, substitution, or insertion. In particular, “site specific mutant DIRS4” encompasses a protein having substantial sequence identity with a protein of SEQ ID NO:2, and typically shares most of the biological activities or effects of the forms disclosed herein.

Although site specific mutation sites are predetermined, mutants need not be site specific. Mammalian DIRS4 mutagenesis can be achieved by making amino acid insertions or deletions in the gene, coupled with expression. Substitutions, deletions, insertions, or many combinations may be generated to arrive at a final construct. Insertions include amino- or carboxy-terminal fusions. Random mutagenesis can be conducted at a target codon and the expressed mammalian DIRS4 mutants can then be screened for the desired activity, providing some aspect of a structure-activity relationship. Methods for making substitution mutations at predetermined sites in DNA having a known sequence are well known in the art, e.g., by M13 primer mutagenesis. See also Sambrook, et al. (1989) and Ausubel, et al. (1987 and periodic Supplements).

The mutations in the DNA normally should not place coding sequences out of reading frames and preferably will not create complementary regions that could hybridize to produce secondary mRNA structure such as loops or hairpins.

The phosphoramidite method described by Beaucage and Carruthers (1981) Tetra. Letts. 22:1859-1862, will produce suitable synthetic DNA fragments. A double stranded fragment will often be obtained either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA polymerase with an appropriate primer sequence.

Polymerase chain reaction (PCR) techniques can often be applied in mutagenesis. Alternatively, mutagenesis primers are commonly used methods for generating defined mutations at predetermined sites. See, e.g., Innis, et al. (eds. 1990) PCR Protocols: A Guide to Methods and Applications Academic Press, San Diego, Calif.; and Dieffenbach and Dveksler (1995; eds.) PCR Primer: A Laboratory Manual Cold Spring Harbor Press, CSH, NY.

Antisense and other technologies for blocking expression of these genes are also available. See, e.g., Misquitta and Paterson (1999) Proc. Nat'l Acad. Sci. USA 96:1451-1456.

IV. Proteins, Peptides

As described above, the present invention encompasses primate DIRS4, e.g., whose sequences are disclosed in SEQ ID NO:2, and described above. Allelic and other variants are also contemplated, including, e.g., fusion proteins combining portions of such sequences with others, including epitope tags and functional domains. Analogous methods and applications exist directed to the other genes described herein.

The present invention also provides recombinant proteins, e.g., heterologous fusion proteins using segments from these proteins. A heterologous fusion protein is a fusion of proteins or segments which are naturally not normally fused in the same manner. Thus, e.g., the fusion product of a DIRS4 with another cytokine receptor is a continuous protein molecule having sequences fused in a typical peptide linkage, typically made as a single translation product and exhibiting properties, e.g., sequence or antigenicity, derived from each source peptide. A similar concept applies to heterologous nucleic acid sequences.

In addition, new constructs may be made from combining similar functional or structural domains from other related proteins, e.g., cytokine receptors or Toll-like receptor like genes, including species variants. For example, ligand-binding or other segments may be “swapped” between different new fusion polypeptides or fragments. See, e.g., Cunningham, et al. (1989) Science 243:1330-1336; and O'Dowd, et al. (1988) J. Biol. Chem. 263:15985-15992, each of which is incorporated herein by reference. Thus, new chimeric polypeptides exhibiting new combinations of specificities will result from the functional linkage of receptor-binding specificities. For example, the ligand binding domains from other related receptor molecules may be added or substituted for other domains of this or related proteins. The resulting protein will often have hybrid function and properties. For example, a fusion protein may include a targeting domain which may serve to provide sequestering of the fusion protein to a particular subcellular organelle.

Candidate fusion partners and sequences can be selected from various sequence data bases, e.g., GenBank, do IntelliGenetics, Mountain View, Calif.; and BCG, University of Wisconsin Biotechnology Computing Group, Madison, Wis., which are each incorporated herein by reference.

The present invention particularly provides muteins which bind cytokine-like ligands, and/or which are affected in signal transduction. Structural alignment of human DIRS4 with other members of the cytokine receptor family show conserved features/residues. Alignment of the human DIRS4 sequence with other members of the cytokine receptor family indicates various structural and functionally shared features. See also, Bazan, et al. (1996) Nature 379:591; Lodi, et al. (1994) Science 263:1762-1766; Sayle and Milner-White (1995) TIBS 20:374-376; and Gronenberg, et al. (1991) Protein Engineering 4:263-269. Similarly, the other genes have related family members.

Substitutions with either mouse sequences or human sequences are particularly preferred. Conversely, conservative substitutions away from the ligand binding interaction regions will probably preserve most signaling activities; and conservative substitutions away from the intracellular domains will probably preserve most ligand binding properties.

“Derivatives” of the various proteins include amino acid sequence mutants, glycosylation variants, metabolic derivatives, and covalent or aggregative conjugates with other chemical moieties. Covalent derivatives can be prepared by linkage of functionalities to groups which are found in amino acid side chains or at the N- or C-termini, e.g., by means which are well known in the art. These derivatives can include, without limitation, aliphatic esters or amides of the carboxyl terminus, or of residues containing carboxyl side chains, O-acyl derivatives of hydroxyl group-containing residues, and N-acyl derivatives of the amino terminal amino acid or amino-group containing residues, e.g., lysine or arginine. Acyl groups are selected from the group of alkyl-moieties, including C3 to C18 normal alkyl, thereby forming alkanoyl aroyl species.

In particular, glycosylation alterations are included, e.g., made by modifying the glycosylation patterns of a polypeptide during its synthesis and processing, or in further processing steps. Particularly preferred means for accomplishing this are by exposing the polypeptide to glycosylating enzymes derived from cells which normally provide such processing, e.g., mammalian glycosylation enzymes. Deglycosylation enzymes are also contemplated. Also embraced are versions of the same primary amino acid sequence which have other minor modifications, including phosphorylated amino acid residues, e.g., phosphotyrosine, phosphoserine, or phosphothreonine.

A major group of derivatives are covalent conjugates of the proteins or fragments thereof with other proteins of polypeptides. These derivatives can be synthesized in recombinant culture such as N- or C-terminal fusions or by the use of agents known in the art for their usefulness in cross-linking proteins through reactive side groups. Preferred derivatization sites with cross-linking agents are at free amino groups, carbohydrate moieties, and cysteine residues.

Fusion polypeptides between the proteins and other homologous or heterologous proteins are also provided. Homologous polypeptides may be fusions between different proteins, resulting in, for instance, a hybrid protein exhibiting binding specificity for multiple different cytokine ligands, or a receptor which may have broadened or weakened specificity of substrate effect. Likewise, heterologous fusions may be constructed which would exhibit a combination of properties or activities of the derivative proteins. Typical examples are fusions of a reporter polypeptide, e.g., luciferase, with a segment or domain of a receptor, e.g., a ligand-binding segment, so that the presence or location of a desired ligand may be easily determined. See, e.g., Dull, et al., U.S. Pat. No. 4,859,609, which is hereby incorporated herein by reference. Other gene fusion partners include glutathione-S-transferase (GST), bacterial β-galactosidase, trpE, Protein A, β-lactamase, alpha amylase, alcohol dehydrogenase, and yeast alpha mating factor. See, e.g., Godowski, et al. (1988) Science 241:812-816.

The phosphoramidite method described by Beaucage and Carruthers (1981) Tetra. Letts. 22:1859-1862, will produce suitable synthetic DNA fragments. A double stranded fragment will often be obtained either by synthesizing the complementary strand and annealing the strand together under appropriate conditions or by adding the complementary strand using DNA polymerase with an appropriate primer sequence.

Such polypeptides may also have amino acid residues which have been chemically modified by phosphorylation, sulfonation, biotinylation, or the addition or removal of other moieties, particularly those which have molecular shapes similar to phosphate groups. In some embodiments, the modifications will be useful labeling reagents, or serve as purification targets, e.g., affinity ligands.

Fusion proteins will typically be made by either recombinant nucleic acid methods or by synthetic polypeptide methods. Techniques for nucleic acid manipulation and expression are described generally, for example, in Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual (2d ed.), Vols. 1-3, Cold Spring Harbor Laboratory, and Ausubel, et al. (eds. 1987 and periodic supplements) Current Protocols in Molecular Biology, Greene/Wiley, New York, which are each incorporated herein by reference. Techniques for synthesis of polypeptides are described, for example, in Merrifield (1963) J. Amer. Chem. Soc. 85:2149-2156; Merrifield (1986) Science 232: 341-347; and Atherton, et al. (1989) Solid Phase Peptide Synthesis: A Practical Approach, IRL Press, Oxford; each of which is incorporated herein by reference. See also Dawson, et al. (1994) Science 266:776-779 for methods to make larger polypeptides.

This invention also contemplates the use of derivatives of these proteins other than variations in amino acid sequence or glycosylation. Such derivatives may involve covalent or aggregative association with chemical moieties. These derivatives generally fall into three classes: (1) salts, (2) side chain and terminal residue covalent modifications, and (3) adsorption complexes, for example with cell membranes. Such covalent or aggregative derivatives are useful as immunogens, as reagents in immunoassays, or in purification methods such as for affinity purification of a receptor or other binding molecule, e.g., an antibody. For example, a cytokine ligand can be immobilized by covalent bonding to a solid support such as cyanogen bromide-activated Sepharose, by methods which are well known in the art, or adsorbed onto polyolefin surfaces, with or without glutaraldehyde cross-linking, for use in the assay or purification of an cytokine receptor, antibodies, or other similar molecules. The ligand can also be labeled with a detectable group, for example radioiodinated by the chloramine T procedure, covalently bound to rare earth chelates, or conjugated to another fluorescent moiety for use in diagnostic assays.

A polypeptide of this invention can be used as an immunogen for the production of antisera or antibodies. These may be specific, e.g., capable of detecting or distinguishing between other related family members or various fragments thereof. The purified proteins can be used to screen monoclonal antibodies or antigen-binding fragments prepared by immunization with various forms of impure preparations containing the protein. In particular, the term “antibodies” also encompasses antigen binding fragments of natural antibodies, e.g., Fab, Fab2, Fv, etc. The purified proteins can also be used as a reagent to detect antibodies generated in response to the presence of elevated levels of expression, or immunological disorders which lead to antibody production to the endogenous receptor. Additionally, fragments may also serve as immunogens to produce the antibodies of the present invention. For example, this invention contemplates antibodies having binding affinity to or being raised against the amino acid sequences provided, fragments thereof, or various homologous peptides. In particular, this invention contemplates antibodies having binding affinity to, or having been raised against, specific fragments which are predicted to be, or actually are, exposed at the exterior protein surfaces.

The blocking of physiological response to the receptor ligands may result from the inhibition of binding of the ligand to the receptor, likely through competitive inhibition. Antibodies to ligands may be antagonists. Thus, in vitro assays of the present invention will often use antibodies or antigen binding segments of these antibodies, or fragments attached to solid phase substrates. Assays will also allow for the diagnostic determination of the effects of mutations and modifications, e.g., which affect signaling or enzymatic function.

This invention also contemplates the use of competitive drug screening assays, e.g., where neutralizing antibodies to the receptor or fragments compete with a test compound for binding to a ligand or other antibody. In this manner, the neutralizing antibodies or fragments can be used to detect the presence of a polypeptide which shares one or more binding sites to a receptor and can also be used to occupy binding sites on a receptor that might otherwise bind a ligand.

V. Making Nucleic Acids and Protein

DNA which encodes the protein or fragments thereof can be obtained by chemical synthesis, screening cDNA libraries, or by screening genomic libraries prepared from a wide variety of cell lines or tissue samples. Natural sequences can be isolated using standard methods and the sequences provided herein. Other species counterparts can be identified by hybridization techniques, or by various PCR techniques, or combined with or by searching in sequence databases, e.g., GenBank.

This DNA can be expressed in a wide variety of host cells which can, in turn, e.g., be used to generate polyclonal or monoclonal antibodies; for binding studies; for construction and expression of modified constructs; and for structure/function studies. Variants or fragments can be expressed in host cells that are transformed or transfected with appropriate expression vectors. These molecules can be substantially free of protein or cellular contaminants, other than those derived from the recombinant host, and therefore are particularly useful in pharmaceutical compositions when combined with a pharmaceutically acceptable carrier and/or diluent. The protein, or portions thereof, may be expressed as fusions with other proteins.

Expression vectors are typically self-replicating DNA or RNA constructs containing the desired receptor gene or its fragments, usually operably linked to suitable genetic control elements that are recognized in a suitable host cell. These control elements are capable of effecting expression within a suitable host. The specific type of control elements necessary to effect expression will depend upon the eventual host cell used. Generally, the genetic control elements can include a prokaryotic promoter system or a eukaryotic promoter expression control system, and typically include a transcriptional promoter, an optional operator to control the onset of transcription, transcription enhancers to elevate the level of mRNA expression, a sequence that encodes a suitable ribosome binding site, and sequences that terminate transcription and translation. Expression vectors also usually contain an origin of replication that allows the vector to replicate independently of the host cell.

The vectors of this invention include those which contain DNA which encodes a protein, as described, or a fragment thereof encoding a biologically active equivalent polypeptide. The DNA can be under the control of a viral promoter and can encode a selection marker. This invention further contemplates use of such expression vectors which are capable of expressing eukaryotic cDNA coding for such a protein in a prokaryotic or eukaryotic host, where the vector is compatible with the host and where the eukaryotic cDNA coding for the receptor is inserted into the vector such that growth of the host containing the vector expresses the cDNA in question. Usually, expression vectors are designed for stable replication in their host cells or for amplification to greatly increase the total number of copies of the desirable gene per cell. It is not always necessary to require that an expression vector replicate in a host cell, e.g., it is possible to effect transient expression of the protein or its fragments in various hosts using vectors that do not contain a replication origin that is recognized by the host cell. It is also possible to use vectors that cause integration of the protein encoding portion or its fragments into the host DNA by recombination.

Vectors, as used herein, comprise plasmids, viruses, bacteriophage, integratable DNA fragments, and other vehicles which enable the integration of DNA fragments into the genome of the host. Expression vectors are specialized vectors which contain genetic control elements that effect expression of operably linked genes. Plasmids are the most commonly used form of vector but all other forms of vectors which serve an equivalent function and which are, or become, known in the art are suitable for use herein. See, e.g., Pouwels, et al. (1985 and Supplements) Cloning Vectors: A Laboratory Manual, Elsevier, N.Y., and Rodriguez, et al. (eds. 1988) Vectors: A Survey of Molecular Cloning Vectors and Their Uses, Buttersworth, Boston, which are incorporated herein by reference.

Transformed cells are cells, preferably mammalian, that have been transformed or transfected with receptor vectors constructed using recombinant DNA techniques. Transformed host cells usually express the desired protein or its fragments, but for purposes of cloning, amplifying, and manipulating its DNA, do not need to express the subject protein. This invention further contemplates culturing transformed cells in a nutrient medium, thus permitting the receptor to accumulate in the cell membrane. The protein can be recovered, either from the culture or, in certain instances, from the culture medium.

For purposes of this invention, nucleic sequences are operably linked when they are functionally related to each other. For example, DNA for a presequence or secretory leader is operably linked to a polypeptide if it is expressed as a preprotein or participates in directing the polypeptide to the cell membrane or in secretion of the polypeptide. A promoter is operably linked to a coding sequence if it controls the transcription of the polypeptide; a ribosome binding site is operably linked to a coding sequence if it is positioned to permit translation. Usually, operably linked means contiguous and in reading frame, however, certain genetic elements such as repressor genes are not contiguously linked but still bind to operator sequences that in turn control expression.

Suitable host cells include prokaryotes, lower eukaryotes, and higher eukaryotes. Prokaryotes include both gram negative and gram positive organisms, e.g., E. coli and B. subtilis. Lower eukaryotes include yeasts, e.g., S. cerevisiae and Pichia, and species of the genus Dictyostelium. Higher eukaryotes include established tissue culture cell lines from animal cells, both of non-mammalian origin, e.g., insect cells, and birds, and of mammalian origin, e.g., human, primates, and rodents.

Prokaryotic host-vector systems include a wide variety of vectors for many different species. As used herein, E. coli and its vectors will be used generically to include equivalent vectors used in other prokaryotes. A representative vector for amplifying DNA is pBR322 or many of its derivatives. Vectors that can be used to express the receptor or its fragments include, but are not limited to, such vectors as those containing the lac promoter (pUC-series); trp promoter (pBR322-trp); Ipp promoter (the pIN-series); lambda-pP or pR promoters (pOTS); or hybrid promoters such as ptac (pDR540). See Brosius, et al. (1988) “Expression Vectors Employing Lambda-, trp-, lac-, and Ipp-derived Promoters”, in Vectors: A Survey of Molecular Cloning Vectors and Their Uses, (eds. Rodriguez and Denhardt), Buttersworth, Boston, Chapter 10, pp. 205-236, which is incorporated herein by reference.

Lower eukaryotes, e.g., yeasts and Dictyostelium, may be transformed with DIRS4 sequence containing vectors. For purposes of this invention, the most common lower eukaryotic host is the baker's yeast, Saccharomyces cerevisiae. It will be used to generically represent lower eukaryotes although a number of other strains and species are also available. Yeast vectors typically consist of a replication origin (unless of the integrating type), a selection gene, a promoter, DNA encoding the receptor or its fragments, and sequences for translation termination, polyadenylation, and transcription termination. Suitable expression vectors for yeast include such constitutive promoters as 3-phosphoglycerate kinase and various other glycolytic enzyme gene promoters or such inducible promoters as the alcohol dehydrogenase 2 promoter or metallothionine promoter. Suitable vectors include derivatives of the following types: self-replicating low copy number (such as the YRp-series), self-replicating high copy number (such as the YEp-series); integrating types (such as the YIp-series), or mini-chromosomes (such as the YCp-series).

Higher eukaryotic tissue culture cells are normally the preferred host cells for expression of the functionally active interleukin protein. In principle, many higher eukaryotic tissue culture cell lines are workable, e.g., insect baculovirus expression systems, whether from an invertebrate or vertebrate source. However, mammalian cells are preferred. Transformation or transfection and propagation of such cells has become a routine procedure. Examples of useful cell lines include HeLa cells, Chinese hamster ovary (CHO) cell lines, baby rat kidney (BRK) cell lines, insect cell lines, bird cell lines, and monkey (COS) cell lines. Expression vectors for such cell lines usually include an origin of replication, a promoter, a translation initiation site, RNA splice sites (if genomic DNA is used), a polyadenylation site, and a transcription termination site. These vectors also usually contain a selection gene or amplification gene. Suitable expression vectors may be plasmids, viruses, or retroviruses carrying promoters derived, e.g., from such sources as from adenovirus, SV40, parvoviruses, vaccinia virus, or cytomegalovirus. Representative examples of suitable expression vectors include pcDNA1; pCD, see Okayama, et al. (1985) Mol. Cell. Biol. 5:1136-1142; pMC1neo PolyA, see Thomas, et al. (1987) Cell 51:503-512; and a baculovirus vector such as pAC 373 or pAC 610.

For secreted proteins, an open reading frame usually encodes a polypeptide that consists of a mature or secreted product covalently linked at its N-terminus to a signal peptide. The signal peptide is cleaved prior to secretion of the mature, or active, polypeptide. The cleavage site can be predicted with a high degree of accuracy from empirical rules, e.g., von-Heijne (1986) Nucleic Acids Research 14:4683-4690 and Nielsen, et al. (1997) Protein Eng. 10:1-12, and the precise amino acid composition of the signal peptide often does not appear to be critical to its function, e.g., Randall, et al. (1989) Science 243:1156-1159; Kaiser et al. (1987) Science 235:312-317.

It will often be desired to express these polypeptides in a system which provides a specific or defined glycosylation pattern. In this case, the usual pattern will be that provided naturally by the expression system. However, the pattern will be modifiable by exposing the polypeptide, e.g., an unglycosylated form, to appropriate glycosylating proteins introduced into a heterologous expression system. For example, the gene may be co-transformed with one or more genes encoding mammalian or other glycosylating enzymes. Using this approach, certain mammalian glycosylation patterns will be achievable in prokaryote or other cells.

The source of protein can be a eukaryotic or prokaryotic host expressing recombinant gene, such as is described above. The source can also be a cell line such as mouse Swiss 3T3 fibroblasts, but other mammalian cell lines are also contemplated by this invention, with the preferred cell line being from the human species.

Now that the sequences are known, the primate protein, fragments, or derivatives thereof can be prepared by conventional processes for synthesizing peptides. These include processes such as are described in Stewart and Young (1984) Solid Phase Peptide Synthesis, Pierce Chemical Co., Rockford, Ill.; Bodanszky and Bodanszky (1984) The Practice of Peptide Synthesis, Springer-Verlag, New York; and Bodanszky (1984) The Principles of Peptide Synthesis, Springer-Verlag, New York; all of each which are incorporated herein by reference. For example, an azide process, an acid chloride process, an acid anhydride process, a mixed anhydride process, an active ester process (for example, p-nitrophenyl ester, N-hydroxysuccinimide ester, or cyanomethyl ester), a carbodiimidazole process, an oxidative-reductive process, or a dicyclohexylcarbodiimide (DCCD)/additive process can be used. Solid phase and solution phase syntheses are both applicable to the foregoing processes. Similar techniques can be used with partial polypeptide sequences.

The various proteins, fragments, or derivatives are suitably prepared in accordance with the above processes as typically employed in peptide synthesis, generally either by a so-called stepwise process which comprises condensing an amino acid to the terminal amino acid, one by one in sequence, or by coupling peptide fragments to the terminal amino acid. Amino groups that are not being used in the coupling reaction typically must be protected to prevent coupling at an incorrect location.

If a solid phase synthesis is adopted, the C-terminal amino acid is bound to an insoluble carrier or support through its carboxyl group. The insoluble carrier is not particularly limited as long as it has a binding capability to a reactive carboxyl group. Examples of such insoluble carriers include halomethyl resins, such as chloromethyl resin or bromomethyl resin, hydroxymethyl resins, phenol resins, tert-alkyloxycarbonylhydrazidated resins, and the like.

An amino group-protected amino acid is bound in sequence through condensation of its activated carboxyl group and the reactive amino group of the previously formed peptide or chain, to synthesize the peptide step by step. After synthesizing the complete sequence, the peptide is split off from the insoluble carrier to produce the peptide. This solid-phase approach is generally described by Merrifield, et al. (1963) in J. Am. Chem. Soc. 85:2149-2156, which is incorporated herein by reference.

The prepared protein and fragments thereof can be isolated and purified from the reaction mixture by means of peptide separation, e.g., by extraction, precipitation, electrophoresis, various forms of chromatography, and the like. The proteins of this invention can be obtained in varying degrees of purity depending upon desired uses. Purification can be accomplished by use of the protein purification techniques disclosed herein, see below, or by the use of the antibodies herein described in methods of immunoabsorbant affinity chromatography. This immunoabsorbant affinity chromatography is carried out by first linking the antibodies to a solid support and then contacting the linked antibodies with solubilized lysates of appropriate cells, lysates of other cells expressing the receptor, or lysates or supernatants of cells producing the protein as a result of DNA techniques, see below.

Generally, the purified protein will be at least about 40% pure, ordinarily at least about 50% pure, usually at least about 60% pure, typically at least about 70% pure, more typically at least about 80% pure, preferable at least about 90% pure and more preferably at least about 95% pure, and in particular embodiments, 97%-99% or more. Purity will usually be on a weight basis, but can also be on a molar basis. Different assays will be applied as appropriate.

VI. Antibodies

Antibodies can be raised to the various mammalian, e.g., primate DIRS4, proteins and fragments thereof, both in naturally occurring native forms and in their recombinant forms, the difference being that antibodies to the active receptor are more likely to recognize epitopes which are only present in the native conformations. Denatured antigen detection can also be useful in, e.g., Western analysis. Anti-idiotypic antibodies are also contemplated, which would be useful as agonists or antagonists of a natural receptor or an antibody.

Antibodies, including binding fragments and single chain versions, against predetermined fragments of the protein can be raised by immunization of animals with conjugates of the fragments with immunogenic proteins. Monoclonal antibodies are prepared from cells secreting the desired antibody. These antibodies can be screened for binding to normal or defective protein, or screened for agonistic or antagonistic activity. These monoclonal antibodies will usually bind with at least a K_(D) of about 1 mM, more usually at least about 300 μM, typically at least about 100 μM, more typically at least about 30 μM, preferably at least about 10 μM, and more preferably at least about 3 μM or better.

The antibodies, including antigen binding fragments, of this invention can have significant diagnostic or therapeutic value. They can be potent agonists or antagonists, e.g., that bind to the receptor and inhibit or simulate binding to ligand, or inhibit the ability of the receptor to elicit a biological response, e.g., act on its substrate. They also can be useful as non-neutralizing antibodies or for use as markers for detection or diagnosis, and can be coupled to toxins or radionuclides to bind producing cells. Further, these antibodies can be conjugated to drugs or other therapeutic agents, either directly or indirectly by means of a linker.

The antibodies of this invention can also be useful in diagnostic applications. As capture or non-neutralizing antibodies, they might bind to the antigen without inhibiting, e.g., ligand or substrate binding. As neutralizing antibodies, they can be useful in competitive binding assays. They will also be useful in detecting or quantifying antigen. They may be used as reagents for Western blot analysis, or for immunoprecipitation or immunopurification of the respective protein.

Protein fragments may be joined to other materials, particularly polypeptides, as fused or covalently joined polypeptides to be used as immunogens. Mammalian cytokine receptors, cytokines, enzymes, marker proteins, and fragments may be fused or covalently linked to a variety of immunogens, such as keyhole limpet hemocyanin, bovine serum albumin, tetanus toxoid, etc. See Microbiology, Hoeber Medical Division, Harper and Row, 1969; Landsteiner (1962) Specificity of Serological Reactions, Dover Publications, New York; and Williams, et al. (1967) Methods in Immunology and Immunochemistry, Vol. 1, Academic Press, New York; each of which are incorporated herein by reference, for descriptions of methods of preparing polyclonal antisera. A typical method involves hyperimmunization of an animal with an antigen. The blood of the animal is then collected shortly after the repeated immunizations and the gamma globulin is isolated.

In some instances, it is desirable to prepare monoclonal antibodies from various mammalian hosts, such as mice, rodents, primates, humans, etc. Description of techniques for preparing such monoclonal antibodies may be found in, e.g., Stites, et al. (eds.) Basic and Clinical Immunology (4th ed.), Lange Medical Publications, Los Altos, Calif., and references cited therein; Harlow and Lane (1988) Antibodies: A Laboratory Manual, CSH Press; Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed.) Academic Press, New York; and particularly in Kohler and Milstein (1975) in Nature 256: 495-497, which discusses one method of generating monoclonal antibodies. Summarized briefly, this method involves injecting an animal with an immunogen. The animal is then sacrificed and cells taken from its spleen, which are then fused with myeloma cells. The result is a hybrid cell or “hybridoma” that is capable of reproducing in vitro. The population of hybridomas is then screened to isolate individual clones, each of which secrete a single antibody species to the immunogen. In this manner, the individual antibody species obtained are the products of immortalized and cloned single B cells from the immune animal generated in response to a specific site recognized on the immunogenic substance.

Other suitable techniques involve in vitro exposure of lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of antibodies in phage or similar vectors. See, Huse, et al. (1989) “Generation of a Large Combinatorial Library of the Immunoglobulin Repertoire in Phage Lambda,” Science 246:1275-1281; and Ward, et al. (1989) Nature 341:544-546. The polypeptides and antibodies of the present invention may be used with or without modification, including chimeric or humanized antibodies. Frequently, the polypeptides and antibodies will be labeled by joining, either covalently or non-covalently, a substance which provides for a detectable signal. A wide variety of labels and conjugation techniques are known and are reported extensively in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent moieties, chemiluminescent moieties, magnetic particles, and the like. Patents, teaching the use of such labels include U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. Also, recombinant or chimeric immunoglobulins may be produced, see Cabilly, U.S. Pat. No. 4,816,567; or made in transgenic mice, see Mendez, et al. (1997) Nature Genetics 15:146-156.

The antibodies of this invention can also be used for affinity chromatography in isolating the proteins or peptides. Columns can be prepared where the antibodies are linked to a solid support, e.g., particles, such as agarose, Sephadex, or the like, where a cell lysate may be passed through the column, the column washed, followed by increasing concentrations of a mild denaturant, whereby the purified protein will be released. Conversely, the protein may be used to purify antibody by immunoselection.

The antibodies may also be used to screen expression libraries for particular expression products. Usually the antibodies used in such a procedure will be labeled with a moiety allowing easy detection of presence of antigen by antibody binding.

Antibodies raised against a protein will also be used to raise anti-idiotypic antibodies. These will be useful in detecting or diagnosing various immunological conditions related to expression of the protein or cells which express the protein. They also will be useful as agonists or antagonists of a ligand, which may be competitive inhibitors or substitutes for naturally occurring ligands.

A target protein that specifically binds to or that is specifically immunoreactive with an antibody generated against it, such as an immunogen consisting of a described amino acid sequence, is typically determined in an immunoassay. The immunoassay typically uses a polyclonal antiserum which was raised, e.g., to a protein of SEQ ID NO: 2. This antiserum is selected to have low crossreactivity against other cytokine receptor family members, e.g., IFN receptor subunits, preferably from the same species, and any such crossreactivity is removed by immunoabsorption prior to use in the immunoassay.

In order to produce antisera for use in an immunoassay, the protein, e.g., of SEQ ID NO: 2, is isolated as described herein. For example, recombinant protein may be produced in a mammalian cell line. An appropriate host, e.g., an inbred strain of mice such as Balb/c, is immunized with the selected protein, typically using a standard adjuvant, such as Freund's adjuvant, and a standard mouse immunization protocol (see Harlow and Lane, supra). Alternatively, a synthetic peptide derived from the sequences disclosed herein and conjugated to a carrier protein can be used an immunogen. Polyclonal sera are collected and titered against the immunogen protein in an immunoassay, e.g., a solid phase immunoassay with the immunogen immobilized on a solid support. Polyclonal antisera with a titer of 10⁴ or greater are selected and tested for their cross reactivity against other cytokine receptor family members, e.g., receptors aligned in FIG. 1, using a competitive binding immunoassay such as the one described in Harlow and Lane, supra, at pages 570-573. Preferably at least two cytokine receptor family members are used in this determination. These cytokine receptor family members can be produced as recombinant proteins and isolated using standard molecular biology and protein chemistry techniques as described herein.

Immunoassays in the competitive binding format can be used for the crossreactivity determinations. For example, the protein of SEQ ID NO: 2 can be immobilized to a solid support. Proteins added to the assay compete with the binding of the antisera to the immobilized antigen. The ability of the above proteins to compete with the binding of the antisera to the immobilized protein is compared to selected other receptor subunits. The percent crossreactivity for the above proteins is calculated, using standard calculations. Those antisera with less than 10% crossreactivity with each of the proteins listed above are selected and pooled. The cross-reacting antibodies are then removed from the pooled antisera by immunoabsorption with the above-listed proteins.

The immunoabsorbed and pooled antisera are then used in a competitive binding immunoassay as described above to compare a second protein to the immunogen protein. In order to make this comparison, the two proteins are each assayed at a wide range of concentrations and the amount of each protein required to inhibit 50% of the binding of the antisera to the immobilized protein is determined. If the amount of the second protein required is less than twice the amount of the protein of the selected protein or proteins that is required, then the second protein is said to specifically bind to an antibody generated to the immunogen.

It is understood that these proteins are members of families of homologous proteins. For a particular gene product, such as the DIRS4, the term refers not only to the amino acid sequences disclosed herein, but also to other proteins that are allelic, non-allelic, or species variants. It is also understood that the terms include nonnatural mutations introduced by deliberate mutation using conventional recombinant technology such as single site mutation, or by excising short sections of DNA encoding the respective proteins, or by substituting new amino acids, or adding new amino acids. Such minor alterations typically will substantially maintain the immunoidentity of the original molecule and/or its biological activity. Thus, these alterations include proteins that are specifically immunoreactive with a designated naturally occurring DIRS4 protein. The biological properties of the altered proteins can be determined by expressing the protein in an appropriate cell line and measuring the appropriate effect, e.g., upon transfected lymphocytes. Particular protein modifications considered minor would include conservative substitution of amino acids with similar chemical properties, as described above for the cytokine receptor family as a whole. By aligning a protein optimally with the protein of the cytokine receptors and by using the conventional immunoassays described herein to determine immunoidentity, one can determine the protein compositions of the invention.

VII. Kits and Quantitation

Both naturally occurring and recombinant forms of the molecules of this invention are particularly useful in kits and assay methods. For example, these methods would also be applied to screening for binding activity, e.g., ligands or receptors for these proteins. Several methods of automating assays have been developed in recent years so as to permit screening of tens of thousands of compounds per year. See, e.g., a BIOMEK automated workstation, Beckman Instruments, Palo Alto, Calif., and Fodor, et al. (1991) Science 251:767-773, which is incorporated herein by reference. The latter describes means for testing binding by a plurality of defined polymers synthesized on a solid substrate. The development of suitable assays to screen for a ligand or agonist/antagonist homologous proteins can be greatly facilitated by the availability of large amounts of purified, soluble cytokine receptors in an active state such as is provided by this invention. Alternatively, production of large amounts of ligand will be useful in screening for receptor. Markers will also be available in large amounts to generate specific reagents.

Purified protein, e.g., DIRS4, can be coated directly onto plates or otherwise presented for use in the ligand or antibody screening techniques. However, non-neutralizing antibodies to these proteins can be used as capture antibodies to immobilize the respective receptor on the solid phase, useful, e.g., in diagnostic uses.

This invention also contemplates use of, e.g., DIRS4, fragments thereof, peptides, and their fusion products in a variety of diagnostic kits and methods for detecting the presence of the protein or its ligand. Alternatively, or additionally, antibodies against the molecules may be incorporated into the kits and methods. Typically the kit will have a compartment containing either a peptide or gene segment or a reagent which recognizes one or the other. Typically, recognition reagents, in the case of peptide, would be a receptor or antibody, or in the case of a gene segment, would usually be a hybridization probe. Diagnostic applications will be useful for the markers, as described.

A preferred kit for determining the concentration of, e.g., DIRS4, in a sample would typically comprise a labeled compound, e.g., ligand or antibody, having known binding affinity for DIRS4, a source of DIRS4 (naturally occurring or recombinant) as a positive control, and a means for separating the bound from free labeled compound, for example a solid phase for immobilizing the DIRS4 in the test sample. Compartments containing reagents, and instructions, will normally be provided.

Antibodies, including antigen binding fragments, specific for mammalian claudins or schlafens or a peptide fragment, or receptor fragments are useful in diagnostic applications to detect the presence of elevated levels of protein and/or its fragments. Diagnostic assays may be homogeneous (without a separation step between free reagent and antibody-antigen complex) or heterogeneous (with a separation step). Various commercial assays exist, such as radioimmunoassay (RIA), enzyme-linked immunosorbent assay (ELISA), enzyme immunoassay (EIA), enzyme-multiplied immunoassay technique (EMIT), substrate-labeled fluorescent immunoassay (SLFIA) and the like. For example, unlabeled antibodies can be employed by using a second antibody which is labeled and which recognizes the antibody to a cytokine receptor or to a particular fragment thereof. These assays have also been extensively discussed in the literature. See, e.g., Harlow and Lane (1988) Antibodies: A Laboratory Manual, CSH., and Coligan (ed. 1991 and periodic supplements) Current Protocols In Immunology Greene/Wiley, New York.

Anti-idiotypic antibodies may have similar use to serve as agonists or antagonists of cytokine receptors or ligands. These should be useful as therapeutic reagents under appropriate circumstances.

Frequently, the reagents for diagnostic assays are supplied in kits, so as to optimize the sensitivity of the assay. For the subject invention, depending upon the nature of the assay, the protocol, and the label, either labeled or unlabeled antibody, or labeled ligand is provided. This is usually in conjunction with other additives, such as buffers, stabilizers, materials necessary for signal production such as substrates for enzymes, and the like. Preferably, the kit will also contain instructions for proper use and disposal of the contents after use. Typically the kit has compartments for each useful reagent, and will contain instructions for proper use and disposal of reagents. Desirably, the reagents are provided as a dry lyophilized powder, where the reagents may be reconstituted in an aqueous medium having appropriate concentrations for performing the assay.

The aforementioned constituents of the diagnostic assays may be used without modification or may be modified in a variety of ways. For example, labeling may be achieved by covalently or non-covalently joining a moiety which directly or indirectly provides a detectable signal. In many of these assays, a test compound, cytokine receptor, ligand, or antibodies thereto can be labeled either directly or indirectly. Possibilities for direct labeling include label groups: radiolabels such as ¹²⁵I, enzymes (U.S. Pat. No. 3,645,090) such as peroxidase and alkaline phosphatase, and fluorescent labels (U.S. Pat. No. 3,940,475) capable of monitoring the change in fluorescence intensity, wavelength shift, or fluorescence polarization. Both of the patents are incorporated herein by reference. Possibilities for indirect labeling include biotinylation of one constituent followed by binding to avidin coupled to one of the above label groups.

There are also numerous methods of separating the bound from the free ligand, or alternatively the bound from the free test compound. The cytokine receptor can be immobilized on various matrixes followed by washing. Suitable matrices include plastic such as an ELISA plate, filters, and beads. Methods of immobilizing the receptor to a matrix include, without limitation, direct adhesion to plastic, use of a capture antibody, chemical coupling, and biotin-avidin. The last step in this approach involves the precipitation of antibody/antigen complex by any of several methods including those utilizing, e.g., an organic solvent such as polyethylene glycol or a salt such as ammonium sulfate. Other suitable separation techniques include, without limitation, the fluorescein antibody magnetizable particle method described in Rattle, et al. (1984) Clin. Chem. 30(9):1457-1461, and the double antibody magnetic particle separation as described in U.S. Pat. No. 4,659,678, each of which is incorporated herein by reference.

Methods for linking protein or fragments to various labels are well reported in the literature. Many of the techniques involve the use of activated carboxyl groups either through the use of carbodiimide or active esters to form peptide bonds, the formation of thioethers by reaction of a mercapto group with an activated halogen such as chloroacetyl, or an activated olefin such as maleimide, for linkage, or the like. Fusion proteins will also find use in these applications.

Another diagnostic aspect of this invention involves use of oligonucleotide or polynucleotide sequences taken from the sequences provided. These sequences can be used as probes for detecting levels of the respective genes or transcripts in patients suspected of having an immunological or other medical disorder. The preparation of both RNA and DNA nucleotide sequences, the labeling of the sequences, and the preferred size of the sequences has received ample description and discussion in the literature. Normally an oligonucleotide probe should have at least about 14 nucleotides, usually at least about 18 nucleotides, and the polynucleotide probes may be up to several kilobases. Various labels may be employed, most commonly radionuclides, particularly ³²P. However, other techniques may also be employed, such as using biotin modified nucleotides for introduction into a polynucleotide. The biotin then serves as the site for binding to avidin or antibodies, which may be labeled with a wide variety of labels, such as radionuclides, fluorescers, enzymes, or the like. Alternatively, antibodies may be employed which can recognize specific duplexes, including DNA duplexes, RNA duplexes, DNA-RNA hybrid duplexes, or DNA-protein duplexes. The antibodies in turn may be labeled and the assay carried out where the duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound to the duplex can be detected. The use of probes to the novel anti-sense RNA may be carried out in conventional techniques such as nucleic acid hybridization, plus and minus screening, recombinational probing, hybrid released translation (HRT), and hybrid arrested translation (HART). This also includes amplification techniques such as polymerase chain reaction (PCR).

Diagnostic kits which also test for the qualitative or quantitative presence of other markers are also contemplated. Diagnosis or prognosis may depend on the combination of multiple indications used as markers. Thus, kits may test for combinations of markers. See, e.g., Viallet, et al. (1989) Progress in Growth Factor Res. 1:89-97.

VIII. Therapeutic Utility

This invention provides reagents with significant therapeutic value. See, e.g., Levitzki (1996) Curr. Opin. Cell Biol. 8:239-244. The cytokine receptors (naturally occurring or recombinant), fragments thereof, mutein receptors, and antibodies, along with compounds identified as having binding affinity to the receptors or antibodies, should be useful in the treatment of conditions exhibiting abnormal expression of the receptors of their ligands. Such abnormality will typically be manifested by immunological or other disorders. Additionally, this invention should provide therapeutic value in various diseases or disorders associated with abnormal expression or abnormal triggering of response to the ligand. The biology of interferons, IL-10, TNFs, and TGFs are well described. Conversely, the TLRs have also been the subject of much interest, and the described homologs described herein will also be of similar interest. Associations with significant medical conditions for the claudins and schlafens is described below.

Recombinant proteins, muteins, agonist or antagonist antibodies thereto, or antibodies can be purified and then administered to a patient. These reagents can be combined for therapeutic use with additional active ingredients, e.g., in conventional pharmaceutically acceptable carriers or diluents, along with physiologically innocuous stabilizers and excipients. These combinations can be sterile, e.g., filtered, and placed into dosage forms as by lyophilization in dosage vials or storage in stabilized aqueous preparations. This invention also contemplates use of antibodies or binding fragments thereof which are not complement binding.

Ligand screening using receptor or fragments thereof can be performed to identify molecules having binding affinity to the receptors. Subsequent biological assays can then be utilized to determine if a putative ligand can provide competitive binding, which can block intrinsic stimulating activity. Receptor fragments can be used as a blocker or antagonist in that it blocks the activity of ligand. Likewise, a compound having intrinsic stimulating activity can activate the receptor and is thus an agonist in that it simulates the activity of ligand, e.g., inducing signaling. This invention further contemplates the therapeutic use of antibodies to cytokine receptors as antagonists.

Conversely, receptor screening for receptors for ligands can be performed. However, ligands can also be screened for function using biological assays, which are typically simple due to the soluble nature of the ligands.

The quantities of reagents necessary for effective therapy will depend upon many different factors, including means of administration, target site, reagent physiological life, pharmacological life, physiological state of the patient, and other medicants administered. Thus, treatment dosages should be titrated to optimize safety and efficacy. Typically, dosages used in vitro may provide useful guidance in the amounts useful for in situ administration of these reagents. Animal testing of effective doses for treatment of particular disorders will provide further predictive indication of human dosage. Various considerations are described, e.g., in Gilman, et al. (eds. 1990) Goodman and Gilman's: The Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Remington's Pharmaceutical Sciences, 17th ed. (1990), Mack Publishing Co., Easton, Pa.; each of which is hereby incorporated herein by reference. Methods for administration are discussed therein and below, e.g., for oral, intravenous, intraperitoneal, or intramuscular administration, transdermal diffusion, and others. Pharmaceutically acceptable carriers will include water, saline, buffers, and other compounds described, e.g., in the Merck Index, Merck & Co., Rahway, N.J. Dosage ranges would ordinarily be expected to be in amounts lower than 1 mM concentrations, typically less than about 10 μM concentrations, usually less than about 100 nM, preferably less than about 10 pM (picomolar), and most preferably less than about 1 fM (femtomolar), with an appropriate carrier. Slow release formulations, or slow release apparatus will often be utilized for continuous administration.

Cytokines, receptors, fragments thereof, and antibodies or its fragments, antagonists, and agonists, may be administered directly to the host to be treated or, depending on the size of the compounds, it may be desirable to conjugate them to carrier proteins such as ovalbumin or serum albumin prior to their administration. Therapeutic formulations may be administered in many conventional dosage formulations. While it is possible for the active ingredient to be administered alone, it is preferable to present it as a pharmaceutical formulation. Formulations comprise at least one active ingredient, as defined above, together with one or more acceptable carriers thereof. Each carrier must be both pharmaceutically and physiologically acceptable in the sense of being compatible with the other ingredients and not injurious to the patient. Formulations include those suitable for oral, rectal, nasal, or parenteral (including subcutaneous, intramuscular, intravenous and intradermal) administration. The formulations may conveniently be presented in unit dosage form and may be prepared by methods well known in the art of pharmacy. See, e.g., Gilman, et al. (eds. 1990) Goodman and Gilman's: The Pharmacological Bases of Therapeutics, 8th Ed., Pergamon Press; and Remington's Pharmaceutical Sciences, 17th ed. (1990), Mack Publishing Co., Easton, Pa.; Avis, et al. (eds. 1993) Pharmaceutical Dosage Forms: Parenteral Medications Dekker, N.Y.; Lieberman, et al. (eds. 1990) Pharmaceutical Dosage Forms: Tablets Dekker, N.Y.; and Lieberman, et al. (eds. 1990) Pharmaceutical Dosage Forms: Disperse Systems Dekker, N.Y. The therapy of this invention may be combined with or used in association with other therapeutic agents, e.g., agonists or antagonists of other cytokine receptor family members.

IX. Screening

Drug screening using DIRS4, TLR-L receptors, or fragments thereof can be performed to identify compounds having binding affinity to the receptor subunits, including isolation of associated components. See, e.g., Emory and Schlegel (1996) Cost-Effective Strategies for Automated and Accelerated High-Throughput Screening IBC, Inc., Southborough, Mass. Subsequent biological assays can then be utilized to determine if the compound has intrinsic stimulating activity and is therefore a blocker or antagonist in that it blocks the activity of the ligand. Likewise, a compound having intrinsic stimulating activity can activate the receptor and is thus an agonist in that it simulates the activity of a cytokine ligand. This invention further contemplates the therapeutic use of antibodies to the receptor as cytokine agonists or antagonists.

Conversely, for ligands, receptors may be screened. Orphan receptor subunits, or testing of known receptor subunits in known or novel pairings may be performed.

One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant DNA molecules expressing the DIRS4 or TLR-L receptors. Cells may be isolated which express a receptor in isolation from other functional receptors, or in combination with other specific subunits. Such cells, either in viable or fixed form, can be used for standard ligand/receptor binding assays. See also, Parce, et al. (1989) Science 246:243-247; and Owicki, et al. (1990) Proc. Nat'l Acad. Sci. USA 87:4007-4011, which describe sensitive methods to detect cellular responses. Competitive assays are particularly useful, where the cells (source of putative ligand) are contacted and incubated with a labeled receptor or antibody having known binding affinity to the ligand, such as ¹²⁵I-antibody, and a test sample whose binding affinity to the binding composition is being measured. The bound and free labeled binding compositions are then separated to assess the degree of ligand binding. The amount of test compound bound is inversely proportional to the amount of labeled receptor binding to the known source. Any one of numerous techniques can be used to separate bound from free ligand to assess the degree of ligand binding. This separation step could typically involve a procedure such as adhesion to filters followed by washing, adhesion to plastic followed by washing, or centrifugation of the cell membranes. Viable cells could also be used to screen for the effects of drugs on cytokine mediated functions, e.g., second messenger levels, i.e., Ca⁺⁺; cell proliferation; inositol phosphate pool changes; and others. Some detection methods allow for elimination of a separation step, e.g., a proximity sensitive detection system. Calcium sensitive dyes will be useful for detecting Ca⁺⁺ levels, with a fluorimeter or a fluorescence cell sorting apparatus.

X. Ligands

The descriptions of the DIRS4 and TLR-L receptors herein provide means to identify ligands, as described above. Such ligand should bind specifically to the respective receptor with reasonably high affinity. Various constructs are made available which allow either labeling of the receptor to detect its ligand. For example, directly labeling cytokine receptor, fusing onto it markers for secondary labeling, e.g., FLAG or other epitope tags, etc., will allow detection of receptor. This can be histological, as an affinity method for biochemical purification, or labeling or selection in an expression cloning approach. A two-hybrid selection system may also be applied making appropriate constructs with the available cytokine receptor sequences. See, e.g., Fields and Song (1989) Nature 340:245-246.

Generally, descriptions of cytokine receptors will be analogously applicable to individual specific embodiments directed to DIRS4 or TLR-L reagents and compositions. Conversely, soluble ligands, e.g., TNFs and TGFs, will be characterized for biological activity.

The broad scope of this invention is best understood with reference to the following examples, which are not intended to limit the inventions to the specific embodiments.

EXAMPLES

I. General Methods

Some of the standard methods are described or referenced, e.g., in Maniatis, et al. (1982) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor Press; Sambrook, et al. (1989) Molecular Cloning: A Laboratory Manual, (2d ed.), vols. 1-3, CSH Press, NY; Ausubel, et al., Biology, Greene Publishing Associates, Brooklyn, N.Y.; or Ausubel, et al. (1987 and Supplements) Current Protocols in Molecular Biology, Greene/Wiley, New York. Methods for protein purification include such methods as ammonium sulfate precipitation, column chromatography, electrophoresis, centrifugation, crystallization, and others. See, e.g., Ausubel, et al. (1987 and periodic supplements); Coligan, et al. (ed. 1996) and periodic supplements, Current Protocols In Protein Science Greene/Wiley, New York; Deutscher (1990) “Guide to Protein Purification” in Methods in Enzymology, vol. 182, and other volumes in this series; and manufacturer's literature on use of protein purification products, e.g., Pharmacia, Piscataway, N.J., or Bio-Rad, Richmond, Calif. Combination with recombinant techniques allow fusion to appropriate segments, e.g., to a FLAG sequence or an equivalent which can be fused via a protease-removable sequence. See, e.g., Hochuli (1989) Chemische Industrie 12:69-70; Hochuli (1990) “Purification of Recombinant Proteins with Metal Chelate Absorbent” in Setlow (ed.) Genetic Engineering, Principle and Methods 12:87-98, Plenum Press, N.Y.; and Crowe, et al. (1992) QIAexpress: The High Level Expression & Protein Purification System QUIAGEN, Inc., Chatsworth, Calif.

Computer sequence analysis is performed, e.g., using available software programs, including those from the GCG (U. Wisconsin) and GenBank sources. Public sequence databases were also used, e.g., from GenBank and others.

Many techniques applicable to IL-10 or IL-12 receptors may be applied to the DIRS4 or other receptor subunits, as described, e.g., in U.S. Ser. No. 08/110,683 (IL-10 receptor), which is incorporated herein by reference.

II. Computational Analysis

Human sequences were identified from genomic sequence database using, e.g., the BLAST server (Altschul, et al. (1994) Nature Genet. 6:119-129). Standard analysis programs may be used to evaluate structure, e.g., PHD (Rost and Sander (1994) Proteins 19:55-72) and DSC (King and Sternberg (1996) Protein Sci. 5:2298-2310). Standard comparison software includes, e.g., Altschul, et al. (1990) J. Mol. Biol. 215:403-10; Waterman (1995) Introduction to Computational Biology: Maps, Sequences, and Genomes Chapman & Hall; Lander and Waterman (eds. 1995) Calculating the Secrets of Life: Applications of the Mathematical Sciences in Molecular Biology National Academy Press; and Speed and Waterman (eds. 1996) Genetic Mapping and DNA Sequencing (Ima Volumes in Mathematics and Its Applications, Vol 81) Springer Verlag.

III. Cloning of Full-Length cDNAs; Chromosomal Localization

PCR primers derived from the sequences are used to probe a human cDNA library. Full length cDNAs for primate, rodent, or other species DIRS4 are cloned, e.g., by DNA hybridization screening of _gt10 phage. PCR reactions are conducted using T. aquaticus Taqplus DNA polymerase (Stratagene) under appropriate conditions.

Chromosome spreads are prepared. In situ hybridization is performed on chromosome preparations obtained from phytohemagglutinin-stimulated human lymphocytes cultured for 72 h. 5-bromodeoxyuridine was added for the final seven hours of culture (60 _g/ml of medium), to ensure a posthybridization chromosomal banding of good quality.

A PCR fragment, amplified with the help of primers, is cloned into an appropriate vector. The vector is labeled by nick-translation with ³H. The radiolabeled probe is hybridized to metaphase spreads at final concentration of 200 ng/ml of hybridization solution as described in Maffei, et al. (1985) Hum. Genet. 69:327-331.

After coating with nuclear track emulsion (KODAK NTB₂), slides are exposed. To avoid any slipping of silver grains during the banding procedure, chromosome spreads are first stained with buffered Giemsa solution and metaphase photographed. R-banding is then performed by the fluorochrome-photolysis-Giemsa (FPG) method and metaphases rephotographed before analysis. Alternatively, mapped sequence tags may be searched in a database.

Similar appropriate methods are used for other species.

IV. Localization of mRNA

Human multiple tissue (Cat # 1, 2) and cancer cell line blots (Cat # 7757-1), containing approximately 2 μg of poly(A)⁺ RNA per lane, are purchased from Clontech (Palo Alto, Calif.). Probes are radiolabeled with [α-³²P] dATP, e.g., using the Amersham Rediprime random primer labeling kit (RPN1633). Prehybridization and hybridizations are performed at 65° C. in 0.5 M Na₂HPO₄, 7% SDS, 0.5 M EDTA (pH 8.0). High stringency washes are conducted, e.g., at 65° C. with two initial washes in 2×SSC, 0.1% SDS for 40 min followed by a subsequent wash in 0.1×SSC, 0.1% SDS for 20 min. Membranes are then exposed at −70° C. to X-Ray film (Kodak) in the presence of intensifying screens. More detailed studies by cDNA library Southerns are performed with selected human DIRS4 clones to examine their expression in hemopoietic or other cell subsets.

Alternatively, two appropriate primers are selected, e.g., from the tables. RT-PCR is used on an appropriate mRNA sample selected for the presence of message to produce a cDNA, e.g., a sample which expresses the gene.

Full length clones may be isolated by hybridization of cDNA libraries from appropriate tissues pre-selected by PCR signal. Northern blots can be performed.

Message for genes encoding each gene will be assayed by appropriate technology, e.g., PCR, immunoassay, hybridization, or otherwise. Tissue and organ cDNA preparations are available, e.g., from Clontech, Mountain View, Calif. Identification of sources of natural expression are useful, as described. And the identification of functional receptor subunit pairings will allow for prediction of what cells express the combination of receptor subunits which will result in a physiological responsiveness to each of the cytokine ligands.

For mouse distribution, e.g., Southern Analysis can be performed: DNA (5 μg) from a primary amplified cDNA library was digested with appropriate restriction enzymes to release the inserts, run on a 1% agarose gel and transferred to a nylon membrane (Schleicher and Schuell, Keene, N.H.).

Samples for mouse mRNA isolation may include: resting mouse fibroblastic L cell line (C200); Braf:ER (Braf fusion to estrogen receptor) transfected cells, control (C201); T cells, TH1 polarized (Me₁₁₄ bright, CD4+ cells from spleen, polarized for 7 days with IFN-γ and anti IL-4; T200); T cells, TH2 polarized (Mel14 bright, CD4+ cells from spleen, polarized for 7 days with IL-4 and anti-IFN-γ; T201); T cells, highly TH1 polarized (see Openshaw, et al. (1995) J. Exp. Med. 182:1357-1367; activated with anti-CD3 for 2, 6, 16 h pooled; T202); T cells, highly TH2 polarized (see Openshaw, et al. (1995) J. Exp. Med. 182:1357-1367; activated with anti-CD3 for 2, 6, 16 h pooled; T203); CD44− CD25+ pre T cells, sorted from thymus (T204); TH1 T cell clone D1.1, resting for 3 weeks after last stimulation with antigen (T205); TH1 T cell clone D1.1, 10 μg/ml ConA stimulated 15 h (T206); TH2 T cell clone CDC35, resting for 3 weeks after last stimulation with antigen (T207); TH2 T cell clone CDC35, 10 μg/ml ConA stimulated 15 h (T208); Mel14+ naive T cells from spleen, resting (T209); Mel14+ T cells, polarized to Th1 with IFN-γ/IL-12/anti-IL-4 for 6, 12, 24 h pooled (T210); Mel14+ T cells, polarized to Th2 with IL-4/anti-IFN-γ for 6, 13, 24 h pooled (T211); unstimulated mature B cell leukemia cell line A20 (B200); unstimulated B cell line CH12 (B201); unstimulated large B cells from spleen (B202); B cells from total spleen, LPS activated (B203); metrizamide enriched dendritic cells from spleen, resting (D200); dendritic cells from bone marrow, resting (D201); monocyte cell line RAW 264.7 activated with LPS 4 h (M200); bone-marrow macrophages derived with GM and M-CSF (M201); macrophage cell line J774, resting (M202); macrophage cell line J774+LPS+anti-IL-10 at 0.5, 1, 3, 6, 12 h pooled (M203); macrophage cell line J774+LPS+IL-10 at 0.5, 1, 3, 5, 12 h pooled (M204); aerosol challenged mouse lung tissue, Th2 primers, aerosol OVA challenge 7, 14, 23 h pooled (see Garlisi, et al. (1995) Clinical Immunology and Immunopathology 75:75-83; X206); Nippostrongulus-infected lung tissue (see Coffman, et al. (1989) Science 245:308-310; X200); total adult lung, normal (O200); total lung, rag-1 (see Schwarz, et al. (1993) Immunodeficiency 4:249-252; 0205); IL-10 K.O. spleen (see Kuhn, et al. (1991) Cell 75:263-274; X201); total adult spleen, normal (O201); total spleen, rag-1 (O207); IL-10 K.O. Peyer's patches (O202); total Peyer's patches, normal (O210); IL-10 K.O. mesenteric lymph nodes (X203); total mesenteric lymph nodes, normal (O211); IL-10 K.O. colon (X203); total colon, normal (O212); NOD mouse pancreas (see Makino, et al. (1980) Jikken Dobutsu 29:1-13; X205); total thymus, rag-1 (O208); total kidney, rag-1 (O209); total heart, rag-1 (O202); total brain, rag-1 (O203); total testes, rag-1 (O204); total liver, rag-1 (O206); rat normal joint tissue (O300); and rat arthritic joint tissue (X300).

Samples for human mRNA isolation may include: peripheral blood mononuclear cells (monocytes, T cells, NK cells, granulocytes, B cells), resting (T100); peripheral blood mononuclear cells, activated with anti-CD3 for 2, 6, 12 h pooled (T101); T cell, THO clone Mot 72, resting (T102); T cell, THO clone Mot 72, activated with anti-CD28 and anti-CD3 for 3, 6, 12 h pooled (T103); T cell, THO clone Mot 72, anergic treated with specific peptide for 2, 7, 12 h pooled (T104); T cell, TH1 clone HY06, resting (T107); T cell, TH1 clone HY06, activated with anti-CD28 and anti-CD3 for 3, 6, 12 h pooled (T108); T cell, TH1 clone HY06, anergic treated with specific peptide for 2, 6, 12 h pooled (T109); T cell, TH2 clone HY935, resting (T110); T cell, TH2 clone HY935, activated with anti-CD28 and anti-CD3 for 2, 7, 12 h pooled (T111); T cells CD4+CD45RO− T cells polarized 27 days in anti-CD28, IL-4, and anti IFN-γ, TH2 polarized, activated with anti-CD3 and anti-CD28 4 h (T116); T cell tumor lines Jurkat and Hut78, resting (T117); T cell clones, pooled AD130.2, Tc783.12, Tc783.13, Tc783.58, Tc782.69, resting (T118); T cell random γδ T cell clones, resting (T119); Splenocytes, resting (B100); Splenocytes, activated with anti-CD40 and IL-4 (B101); B cell EBV lines pooled WT49, RSB, JY, CVIR, 721.221, RM3, HSY, resting (B 102); B cell line JY, activated with PMA and ionomycin for 1, 6 h pooled (B 103); NK 20 clones pooled, resting (K100); NK 20 clones pooled, activated with PMA and ionomycin for 6 h (K101); NKL clone, derived from peripheral blood of LGL leukemia patient, IL-2 treated (K106); NK cytotoxic clone 640-A30-1, resting (K107); hematopoietic precursor line TF1, activated with PMA and ionomycin for 1, 6 h pooled (C100); U937 premonocytic line, resting (M100); U937 premonocytic line, activated with PMA and ionomycin for 1, 6 h pooled (M101); elutriated monocytes, activated with LPS, IFNγ, anti-IL-10 for 1, 2, 6, 12, 24 h pooled (M102); elutriated monocytes, activated with LPS, IFNγ, IL-10 for 1, 2, 6, 12, 24 h pooled (M103); elutriated monocytes, activated with LPS, IFNγ, anti-IL-10 for 4, 16 h pooled (M106); elutriated monocytes, activated with LPS, IFNγ, IL-10 for 4, 16 h pooled (M107); elutriated monocytes, activated LPS for 1 h (M108); elutriated monocytes, activated LPS for 6 h (M109); DC 70% CD1a+, from CD34+ GM-CSF, TNFα 12 days, resting (D101); DC 70% CD1a+, from CD34+ GM-CSF, TNFα 12 days, activated with PMA and ionomycin for 1 hr (D102); DC 70% CD1a+, from CD34+ GM-CSF, TNFα 12 days, activated with PMA and ionomycin for 6 hr (D103); DC 95% CD1a+, from CD34+ GM-CSF, TNF_(—) 12 days FACS sorted, activated with PMA and ionomycin for 1, 6 h pooled (D104); DC 95% CD14+, ex CD34+ GM-CSF, TNFα 12 days FACS sorted, activated with PMA and ionomycin 1, 6 hr pooled (D105); DC CD1a+ CD86+, from CD34+ GM-CSF, TNFα 12 days FACS sorted, activated with PMA and ionomycin for 1, 6 h pooled (D106); DC from monocytes GM-CSF, IL-4 5 days, resting (D107); DC from monocytes GM-CSF, IL-4 5 days, resting (D108); DC from monocytes GM-CSF, IL-4 5 days, activated LPS 4, 16 h pooled (D109); DC from monocytes GM-CSF, IL-4 5 days, activated TNFα, monocyte supe for 4, 16 h pooled (D110); leiomyoma L11 benign tumor (X101); normal myometrium M5 (O115); malignant leiomyosarcoma GS1 (X103); lung fibroblast sarcoma line MRC5, activated with PMA and ionomycin for 1, 6 h pooled (C101); kidney epithelial carcinoma cell line CHA, activated with PMA and ionomycin for 1, 6 h pooled (C102); kidney fetal 28 wk male (O100); lung fetal 28 wk male (O101); liver fetal 28 wk male (O102); heart fetal 28 wk male (O103); brain fetal 28 wk male (O104); gallbladder fetal 28 wk male (O106); small intestine fetal 28 wk male (O107); adipose tissue fetal 28 wk male (O108); ovary fetal 25 wk female (O109); uterus fetal 25 wk female (O110); testes fetal 28 wk male (O111); spleen fetal 28 wk male (O112); adult placenta 28 wk (O113); and tonsil inflamed, from 12 year old (X100).

For the DIRS4, southern blot analysis revealed expression in several cDNA libraries, including resting MOT72 (Th0 clone); resting, activated, and anti-peptide HY06 (Th1 clone); activated T cells CD4+, Th2 polarized; resting pooled T cell clones; resting and activated splenocytes; resting EBV B cells; activated JY (B cell line); cytotoxic NK cells; TF1 cells; resting and activated U937 cells; monocytes treated with anti-IL-10; monocytes (anti-IL-10 and IL-10 stimulated); activated monocytes; dendritic cells (activated and resting); MRC5 (lung fibroblast sarcoma line); CHA (kidney epithelial carcinoma line); normal and asthmatic monkey lung; normal and smoker lung; normal colon; fetal lung; liver; gall bladder; and small intestine. There were two transcript sizes, about 500 bp and about 1.8 kb bands, suggesting two different transcripts, possibly soluble and membrane spanning forms.

The primate, e.g., human, TNFx expression, by PCR, is high in allergic lung and normal lung; much lower in adult placenta, fetal spleen, and normal skin. Essentially no expression in gut samples and fetal organs. In cells, high expression was detected in resting HY06 cells and TF-1; lower in activated HY06 cell and JY cells, and no significant expression in the other human samples tested, e.g., most in the list above. Table 1 shows additional TaqMan expression data for human TNFx.

TABLE 1 LIBRARY Ct_gene PBMC resting 44.64 PBMC activated 40.48 Mot 72 resting 26.29 Mot 72 activated 24.51 Mot 72 anti-peptide 20.72 HY06 resting 15.86 HY06 activated 18.3 HY06 anti-peptide 24.27 HY935 resting 25.97 HY935 activated 25.03 B21 resting 26.3 B21 activated 24.53 Tc gamma delta 45 Jurkat resting pSPORT 45 Jurkat activated pSPORT 28.09 Splenocytes resting 23.51 Splenocytes activated 26.19 Bc 23.88 JY 19.29 NK pool 38.21 NK pool activated 37.54 NKA6 pSPORT 34.39 NKL/IL-2 25.71 NK cytotox. 23.28 NK non cytotox. 26.35 U937/CD004 resting 28.18 U937 activated 26.21 C− 27 C+ 23.13 mast cell pME 28.65 TC1080 CD28-pMET7 38.1 RV-C30 TR1 pMET7 24.97 DC resting mono-derived 28.12 DC CD40L activ. mono-deriv. 27.07 DC resting CD34-derived 28.9 DC TNF/TGFb act CD34-der. 36.74 allergic lung #19 20.21 Pneumocystis carnii lung #20 36.33 RA synovium pool 28 Psoriasis skin 32.37 normal lung 35.68 4 hr. Ascaris lung 31.45 24 hr. Ascaris lung 26.34 normal lung pool 22.21 Taq_control_genomic_1 50 mono + anti-IL-10 22.47 mono + IL-10 21.04 M1 40.52 M6 21.75 70% DC resting 26.27 D1 37.94 D6 25.05 CD1a+ 95% 26.87 CD14+ 95% 35.17 CD1a+ CD86+ 27.48 DC/GM/IL-4 32.33 DC LPS 27.81 DC mix 27.32 fetal kidney 26.41 fetal lung 31.16 fetal liver 26.28 fetal heart 34.28 fetal brain 25.02 fetal small intestine 37.89 fetal adipose tissue 26.41 fetal ovary 37.49 fetal uterus 26.03 fetal testes 36.65 fetal spleen 23.2 adult placenta 24.06 inflammed tonsil 26.21 TF1 23.48 MRC5 33.99 CHA 28.27 Taq_control_genomic_2 50 Crohns colon 403242A 28.32 lung 080698-2 27.42 18 hr. Ascaris lung 28.06 hi dose IL-4 lung 34.01 normal colon #22 44.6 ulcerative colitis colon #26 38.12 normal thyroid 28.14 Hashimotos thyroiditis 36.88 normal skin 24.12 Crohns colon 4003197A 30.31 lung 121897-1 36.25 Crohns colon 9609C144 27.49 A549 unstim. 28.03 A549 activated 24.1 Taq_control_water 50

The rodent, e.g., mouse, TNFx is highly expressed in 5 month ApoE KO mouse aorta; C57B6 3 wk polarized Th1 cells; and C57B6 3 wk polarized Th2 cells. It is less highly expressed in Balb/c 3 wk polarized Th2 cells, LPS treated spleen, and various other Th2 polarized populations. In tissues, by PCR, it is expressed highly in TNK KO spleen, NZB/W spleen, NZB/W kidney, NZB/W spleen, GF ears/skin; rag-1 testis, w.t. C57B6 spleen, w.t. C57B6 pancreas, and 2 mo. lung. It is expressed at lower levels in influenza lung, rag-1 lung, rag-1 spleen, spinal cord samples, lung samples, stomach, and lymph nodes. Table 2 shows additional TaqMan expression data for mouse TNFx.

TABLE 2 LIBRARY Ct_gene L cell 26 TH1 7 day 26.63 TH2 7 day 24.56 TH1 3 week Balb/C 39.09 TH2 3 week Balb/C 24.48 preT 36.92 D1.1 resting 32.74 D1.1 con A stim. 37.76 CDC35 resting 30.8 CDC35 con A stim. 41.92 Mel 14+ naive T 28.16 Mel 14+ TH1 29.2 Mel 14+ TH2 25.02 A20 37.61 CH12 25.29 lg. B cell 30.34 LPS spleen 24.04 macrophage 28.6 J774 resting 39.73 J774 + LPS + anti-IL-10 36.51 J774 + LPS + IL-10 40.53 Nippo-infected lung 25.87 IL-10 K.O. spleen 24.18 IL-10 K.O. colon 36.97 asthmatic lung 26.61 w.t. lung 24.06 w.t. spleen 28.87 rag-1 heart 26.48 Nippo IL-4 K.O. lung 28.59 Nippo anti IL-5 lung 25.73 Influenza lung 23.93 b common lung 2 month 24.53 IL-10 K.O. stomach 29.87 IL-10 K.O. MLN aIL-12 26.58 IL-10 K.O. MLN + IL-10 25.89 Rag-2 Hh− colon 29.2 Rag-2 Hh+ colon 27.1 IL-7 K.O./Rag-2 Hh− colon 40 IL-7 K.O./Rag-2 Hh+ colon 40 transfer model IBD 28.1 w.t. C57 Bl/6 aorta 39.38 w.t. thymus 27.05 w.t. stomach 26.49 MM2/MM3 resting pME 37.62 rag-1 brain 24.47 rag-1 testes 38.4 rag-1 lung 22.81 rag-1 liver 36.69 rag-1 spleen 24.23 rag-1 thymus 23.91 rag-1 kidney 22.32 w.t. Peyers patches 25.48 w.t. mesenteric lymph nodes 25.59 w.t. colon 28.7 Braf: ER (−) oligo dT 38.53 TH1 3 week C57 Bl/6 23.12 TH2 3 week C57 Bl/6 22.54 TH1 3 week Balb/C fresh 28.02 TH2 3 week Balb/C fresh 37.73 b.m. DC (YJL) resting 27.99 b.m. DC (YJL) aCD40 stim. 40.47 b.m. mf + LPS + aIL-10R 29.74 b.m. mf + LPS + IL-10 27.67 peritoneal mf 37.02 MC-9/MCP-12 pMET7 39.68 EC 40.13 EC + TNFa 40.54 bEnd3 + TNFa 41.26 bEnd3 + TNFa + IL-10 38.35 ApoE aorta 5 month 21.03 ApoE aorta 12 month 34.28 NZ B/W kidney 21.02 NZ B/W spleen 21.2 tolerized & challenged lung 27.17 Aspergillus lung 23.32 Taq_control_water 50 Taq_control_genomic_1 50 Taq_control_genomic_2 50 w.t. d17 spinal cord EAE model 22.87 TNF K.O. d17 spinal cord EAE model 22.84 TNF K.O. spinal cord 23.27 TNF K.O. spleen 20.78 G.F. ears (skin) 20.7 w.t. spinal cord 22.74 w.t. C57 Bl/6 spleen 22.15 w.t. C57 Bl/6 pancreas 24.75 MM2/MM3 activated. pME 37.67

The primate, e.g., human, TNFy is expressed in fetal adipose tissue and fetal ovary. It is expressed at a lower level in fetal brain, Hashimoto's thyroiditis, RA synovium pool, adult placenta, and fetal uterus. It is expressed at lower levels in fetal kidney, normal thyroid, and detectable in Crohn's colon, psoriasis skin, and fetal lung. It is essentially undetectable in other organs evaluated, including various Ascaris challenged lung samples. In cell libraries, it is expressed in TF-1 cells, and much lower in CHA cells, and was not significantly expressed in other cell lines tested. Table 3 provides additional TaqMan expression data for human TNFy.

TABLE 3 LIBRARY Ct_gene PBMC resting 45 PBMC activated 44.16 Mot 72 resting 42.47 Mot 72 activated 28.59 Mot 72 anti-peptide 42.47 HY06 resting 43.19 HY06 activated 41.48 HY06 anti-peptide 43.28 HY935 resting 45 HY935 activated 43.62 B21 resting 41.73 B21 activated 44.35 Tc gamma delta 43.21 Jurkat resting pSPORT 23.44 Jurkat activated pSPORT 25.19 Splenocytes resting 38.72 Splenocytes activated 44.09 Bc 44.83 JY 43.05 NK pool 39.09 NK pool activated 44.32 NKA6 pSPORT 42.8 NKL/IL-2 45 NK cytotox. 44.79 NK non cytotox. 45 U937/CD004 resting 24.17 U937 activated 24.41 C− 40.38 C+ 41.17 mono + anti-IL-10 45 DC resting mono-derived 45 DC CD40L activ. mono-deriv. 45 DC resting CD34− derived 45 DC TNF/TGFb act CD34-der. 39.71 allergic lung #19 43.22 Pneumocystis carnii lung #20 43.81 normal colon #22 43.66 ulcerative colitis colon #26 45 normal thyroid 27.71 Hashimotos thyroiditis 27.4 RA synovium pool 28 Psoriasis skin 31.49 normal lung 45 Crohns colon 403242A 33.18 lung 080698-2 30.01 mono + IL-10 42.96 M1 41.25 M6 45 70% DC resting 40.37 D1 28.94 D6 28.38 CD1a+ 95% 25.63 CD14+ 95% 28.36 CD1a+ CD86+ 28.67 DC/GM/IL-4 45 DC LPS 38.8 DC mix 26.53 fetal kidney 27.98 fetal lung 30.57 fetal liver 43.92 fetal heart 40.84 fetal brain 26.02 fetal small intestine 40.05 fetal adipose tissue 23.63 fetal ovary 25.85 fetal uterus 27.57 fetal testes 45 fetal spleen 39.08 adult placenta 28.05 inflammed tonsil 45 TF1 22.09 MRC5 26.18 CHA 19.22 mast cell pME 43.93 TC1080 CD28-pMET7 41.62 RV-C30 TR1 pMET7 42.76 4 hr. Ascaris lung 45 24 hr. Ascaris lung 45 normal lung pool 45 normal skin 42.69 Crohns colon 4003197A 29.82 lung 121897-1 45 Crohns colon 9609C144 41.86 A549 unstim. 27.09 A549 activated 29.01 Taq_control_water 50 Taq_control_genomic_1 50 Taq_control_genomic_2 50 18 hr. Ascaris lung 44.16 hi dose IL-4 lung 43.59 Table 4 provides TaqMan expression data for rodent, e.g., moust TNFy.

LIBRARY Ct_gene L cell 40 TH1 7 day 40 TH2 7 day 27.11 TH1 3 week Balb/C 40 TH2 3 week Balb/C 26.95 preT 40 D1.1 resting 40 D1.1 con A stim. 40 CDC35 resting 40 CDC35 con A stim. 39.83 Mel 14+ naive T 40 Mel14+ TH1 40 Mel 14+ TH2 31.22 A20 27.39 CH12 28.18 lg. B cell 26.35 LPS spleen 21.58 macrophage 40 J774 resting 24.99 J774 + LPS + anti-IL-10 28.41 J774 + LPS + IL-10 27.57 Nippo-infected lung 26.98 IL-10 K.O. spleen 25.43 IL-10 K.O. colon 23.68 asthmatic lung 37.45 w.t. lung 40 w.t. spleen 39.95 rag-1 heart 40 rag-1 brain 40 rag-1 testes 40 Influenza lung 37.13 b common lung 2 month 39.33 IL-10 K.O. stomach 27.3 IL-10 K.O. MLN aIL-12 40 IL-10 K.O. MLN + IL-10 37.97 Rag-2 Hh− colon 26.95 Rag-2 Hh+ colon 22.94 IL-7 K.O./Rag-2 Hh− colon 26.77 IL-7 K.O./Rag-2 Hh+ colon 24.24 transfer model IBD 23.01 w.t. C57 Bl/6 aorta 40 w.t. spinal cord 38.8 w.t. C57 Bl/6 spleen 26.38 w.t. C57 Bl/6 pancreas 40 rag-1 lung 40 rag-1 liver 40 rag-1 spleen 23.97 rag-1 thymus 26.29 rag-1 kidney 40 w.t. Peyers patches 27.04 w.t. mesenteric lymph nodes 40 w.t. colon 26.63 Braf: ER (−) oligo dT 40 TH1 3 week C57 Bl/6 26.78 TH2 3 week C57 Bl/6 40 TH1 3 week Balb/C fresh 40 TH2 3 week Balb/C fresh 40 b.m. DC (YJL) resting 40 b.m. DC (YJL) aCD40 stim. 40 b.m. mf + LPS + aIL-10R 40 b.m. mf + LPS + IL-10 40 peritoneal mf 40 MC-9/MCP-12 pMET7 40 EC 40 EC + TNFa 40 bEnd3 + TNFa 40 bEnd3 + TNFa + IL-10 40 ApoE aorta 5 month 35.16 ApoE aorta 12 month 35.47 NZ B/W kidney 37.17 NZ B/W spleen 25.25 tolerized & challenged lung 40 Aspergillus lung 39.26 Nippo IL-4 K.O. lung 26.13 Nippo anti IL-5 lung 34.73 w.t. thymus 40 w.t. stomach 30.14 MM2/MM3 resting pME 40 MM2/MM3 activated. pME 40 Taq_control_water 50 Taq_control_genomic_1 50 Taq_control_genomic_2 50 w.t. d17 spinal cord EAE model 40 TNF K.O. d17 spinal cord EAE model 40 TNF K.O. spinal cord 27.99 TNF K.O. spleen 24.93 G.F. ears (skin) 40

The primate, e.g., human, TLR-L1 is expressed in TF-1 cells, D6 cells, and barely detectable in resting U937 cells, resting Jurkat cells, and pooled NK cells. In tissues, it is found in fetal uterus, fetal ovary, allergic lung, and fetal testis. Lower levels are found in fetal kidney, fetal small intestine, fetal brain, fetal adipose tissue, normal lung pool, and fetal lung.

The primate, e.g., human, TLR-L2, TLR-L3, and TLR-L4 seem to be expressed in brain tissue.

The primate, e.g., human, TLR-L5 seems to be expressed in unstimulated A549, activated A549, MRC5, and Bc cell lines. Among tissues, it is most highly expressed in fetal uterus, fetal small intestine, and lesser in fetal lung, fetal kidney, fetal liver, and fetal ovary. It is just detectable in fetal brain, fetal adipose, fetal testes, psoriasis skin, and various intestinal samples.

The 5685C6 probes show positive hybridization to subtraction libraries of Th2 minus Th1 polarized cells, and absence of hybridization to libraries of Th1 minus Th2 polarized cells. This suggests that the probe is present selectively in Th2 polarized cells, and can serve as a marker for such cell type. PCR techniques should confirm the expression profile.

Structurally, this protein exhibits similarities to other proteins possessing a thioredoxin fold, including a peroxidase protein, e.g., glutathione peroxidase. See Choi, et al. (1998) Nature Structural Biol. 5:400-406. Thioredoxin has been reported to exhibit certain chemoattractant activities. See Bertini, et al. (1999) J. Expt'l Med. 189:1783-1789.

TaqMan primers were designed for all four novel claudin transcripts. These primer sets were used to screen a panel of human libraries representing different cell types, tissues, and disease states, and two extended cDNA panels. The cDNA panels were composed of samples derived from either normal or diseased human lung or intestine. The claudin genes are some of the most highly regulated genes detected. Moreover, claudin D8 shows the greatest reciprocal regulation between Crohn's and Ulcerative colitis samples, making it a good candidate in future diagnostic panels for these diseases.

claudin-D2: In library southerns, expression is highest in one Crohn's colon, the fetal intestine, and two epithelial cell lines, lower level expression in fetal lung, kidney, ovary and testes. In human cDNA panels, this is highly up-regulated in 8/9 Crohn's disease, both with and without steroid treatment (mean induction=53×, n=9). In addition, claudin-D2 is also induced in 9/12 ulcerative colitis samples (mean induction=8.2×), but this induction is significantly less than that observed in the Crohn's disease samples. Also up-regulated (mean induction=29×) in 12/13 interstitial lung disease samples (idiopathic pulmonary fibrosis, hypersensitive pneumonitis, and eosinophilic granuloma).

claudin-D8: In library southerns, expression is highest in fetal kidney and normal colon. Also, expressed in ulcerative colitis colon, thyroid, and fetal lung. No expression is observed in the cells on the panel. In human cDNA panels, high level expression in the gut. Little to no expression in all Crohn's disease samples mean reduction 130×, n=9). Some ulcerative colitis samples also have reduced claudin-D8 expression, but the pattern is heterogeneous. In contrast, claudin-D8 is up-regulated in several interstitial lung disease samples (12/15, mean induction=9×), but the level of expression in these samples is on the order of ten fold lower than in normal colon. It is also induced in primary human bronchial epithelial cells by I-309.

claudin-D17: In library southerns, overall the expression level measured is low relative to the other claudins described here, on the order of 100 fold lower. It is unclear whether the expression level is actually lower or whether the primers for this gene are insensitive (non-optimal). Expression is highest in one of the asthma lungs and in psoriatic skin. No expression is observed in the cell lines on the panel. In human cDNA panels, the expression is increased in 8/11 ulcerative colitis samples (mean induction=13×), while the expression is unchanged in Crohn's disease samples. Expressed at low level in primary bronchial epithelial cell lines, induced by 1-309. Otherwise, level is too low to detect except in sporadic samples.

claudin-D7.2: In library southerns, expressed at highest level in human fetal and adult lung, monkey lungs, and in one Crohn's colon sample. Lower level expression in the two epithelial (A549 and CHA) and one fibroblast (MRC5) cell lines on the panel. In human cDNA panels, expressed at a high level in the gut and an even higher level in the lung. Up-regulated in Crohn's disease samples from patients which have not been treated with steroids (mean induction=3.7×, n=4). No consistent modulation of this gene in any of the lung diseases examined on this panel.

Claudin family structure: If the genomic structural organization of Claudin family members is based upon that of Paracellin-1, then the proteins would all be encoded by 5 exons. The putative splice sites and exon numbers are predictable, corresponding to the residues of D2 about: 2 codons upstream from M1; A43, A75, G129, and C182; and transmembrane segments corresponding to about G17-V36, M83-C104, V117-H141, and L164-Q188. Paracellin has an extra 60 amino acids at its N-terminus, which is located on the cytoplasmic side of the membrane.

Disease Associations: Claudin-D2 is up-regulated in 8/9 Crohn's disease relative to the control samples, while claudin-D8 is down-regulated. All claudins, described in this invention disclosure, show disease association as described above.

The claudins may form part of a diagnostic panel of genes that could distinguish Crohn's disease from ulcerative colitis, or assist in the determination of disease severity in either or both diseases. For example, claudin-D2 is expressed at higher levels in Crohn's disease than in ulcerative colitis. In contrast, the claudin-D8, cluster 1645577, is expressed at very low levels in Crohn's disease samples, and is less dramatically reduced in most ulcerative colitis samples. See, e.g., Simon, et al. (1999) Science 285:103-106; Hirano, et al. (19xx) Genome Research 10:659-663; Morita, et al. (1999) Proc. Nat'l Acad. Sci. USA 96:511-516; Anderson and Van Itallie (1999) Current Biology 9:R922-R924; and Furuse, et al. (1999) J. Cell Biol. 147:891-903.

Introduction of an adenovirus or another expression vector expressing the claudin-D8 ortholog into the intestines of patients with inflammatory bowel disease may improve intestinal barrier function and ameliorate disease.

In contrast, antibodies to one of the claudins described here may be able to: induce an intracellular signal that could promote tight junction formation and lead to improved intestinal barrier function; block entry of pathogenic agents, which may play a causative role in initiation or maintenance of either Crohn's disease or ulcerative colitis; promote migration of myeloid cells across tight junctions and allow clearance of pathogenic agents prior to infection of the epithelium.

Expression of schlafen family members in fibroblasts/thymoma cells retards or arrests cell growth. They guide cell growth and T-cell development, and are an integral component of the machinery that maintains T-cell quiescence. They may have important roles in the development or maintenance of autoimmune disorders. The mouse schlafens participate in the regulation of the cell cycle. This family is characterized by two splice variants: a short and a long form.

Schlafen B: 748 aa; ORF. Quantitative PCR analysis reveals in T cells, resting DC, M1 macrophage cell panel. Induced in Hashimoto's thyroiditis, fetal kidney, fetal uterus, and fetal spleen. Slightly induced in Crohn's colon.

Schlafen C: 891 aa, full ORF. Quantitative PCR data revealed this to be significantly up-regulated in all Crohn's samples, asthmatic lung, Ascaris lung, Hashimoto's thyroiditis, and fetal tissues compared to control.

Schlafen D: 578 aa, full ORF. The quantitative PCR data for human schlafen D revealed that it is significantly differentially regulated in Crohn's disease and Ulcerative Colitis compared to normal colon. Also it appears to be highly expressed in many developing tissues (fetal) and disease states (allergic, Ascaris and pneumocystis carni lungs, Crohn's colon, ulcerative colitis, and Psoriasis skin) compared to cell lines.

Schlafen E: 897 aa, full ORF. Quantitative PCR analysis reveals expression in the colon, fetal liver, fetal lung, fetal ovary, and fetal uterus, and significantly upregulated in one Crohn's sample and highly induced in Hashimoto's thyroiditis.

Schlafen F: 358 aa; full ORF. Distribution analysis is not complete.

Similar samples may isolated in other species for evaluation.

V. Cloning of Species Counterparts

Various strategies are used to obtain species counterparts of, e.g., the DIRS4, preferably from other primates or rodents. One method is by cross hybridization using closely related species DNA probes. It may be useful to go into evolutionarily similar species as intermediate steps. Another method is by using specific PCR primers based on the identification of blocks of similarity or difference between genes, e.g., areas of highly conserved or nonconserved polypeptide or nucleotide sequence.

VI. Production of Mammalian Protein

An appropriate, e.g., GST, fusion construct is engineered for expression, e.g., in E. coli. For example, a mouse IGIF pGe_(x) plasmid is constructed and transformed into E. coli. Freshly transformed cells are grown, e.g., in LB medium containing 50 μg/ml ampicillin and induced with IPTG (Sigma, St. Louis, Mo.). After overnight induction, the bacteria are harvested and the pellets containing, e.g., the DIRS4 protein, are isolated. The pellets are homogenized, e.g., in TE buffer (50 mM Tris-base pH 8.0, 10 mM EDTA and 2 mM pefabloc) in 2 liters. This material is passed through a microfluidizer (Microfluidics, Newton, Mass.) three times. The fluidized supernatant is spun down on a Sorvall GS-3 rotor for 1 h at 13,000 rpm. The resulting supernatant containing the cytokine receptor protein is filtered and passed over a glutathione-SEPHAROSE column equilibrated in 50 mM Tris-base pH 8.0. The fractions containing the DIRS4-GST fusion protein are pooled and cleaved, e.g., with thrombin (Enzyme Research Laboratories, Inc., South Bend, Ind.). The cleaved pool is then passed over a Q-SEPHAROSE column equilibrated in 50 mM Tris-base. Fractions containing DIRS4 are pooled and diluted in cold distilled H₂O, to lower the conductivity, and passed back over a fresh Q-Sepharose column, alone or in succession with an immunoaffinity antibody column. Fractions containing the DIRS4 protein are pooled, aliquoted, and stored in the −70° C. freezer.

Comparison of the CD spectrum with cytokine receptor protein may suggest that the protein is correctly folded. See Hazuda, et al. (1969) J. Biol. Chem. 264:1689-1693.

For other genes, e.g., membrane proteins, the protein may be best expressed on cell surfaces. Those may be in prokaryote expression systems, or eukaryotes. Surface expressed forms will most likely have conformations consistent with the natural interaction with lipid.

VII. Determining Physiological Forms of Receptors

The cellular forms of receptors for ligands can be tested with the various ligands and receptor subunits provided, e.g., IL-10 related sequences. In particular, multiple cytokine receptor like ligands have been identified, see, e.g., U.S. Ser. No. 60/027,368, 08/934,959, and 08/842,659, which are incorporated herein by reference.

Cotransformation of the DIRS4 with putative other receptor subunits may be performed. Such cells may be used to screen putative cytokine ligands, such as the AK155, for signaling. A cell proliferation assay may be used.

In addition, it has been known that many cytokine receptors function as heterodimers, e.g., a soluble alpha subunit, and transmembrane beta subunit. Subunit combinations can be tested now with the provided reagents. In particular, appropriate constructs can be made for transformation or transfection of subunits into cells. Combinatorial transfections of transformations can make cells expressing defined subunits, which can be tested for response to the predicted ligands. Appropriate cell types can be used, e.g., 293 T cells, with, e.g., an NFκb reporter construct.

Biological assays for receptors will generally be directed to the ligand binding feature of the protein or to the kinase/phosphatase activity of the receptor. The activity will typically be reversible, as are many other enzyme reactions, and may mediate phosphatase or phosphorylase activities, which activities are easily measured by standard procedures. See, e.g., Hardie, et al. (eds. 1995) The Protein Kinase FactBook vols. I and II, Academic Press, San Diego, Calif.; Hanks, et al. (1991) Meth. Enzymol. 200:38-62; Hunter, et al. (1992) Cell 70:375-388; Lewin (1990) Cell 61:743-752; Pines, et al. (1991) Cold Spring Harbor Symp. Quant. Biol. 56:449-463; and Parker, et al. (1993) Nature 363:736-738.

The family of cytokines contains molecules which are important mediators of hematopoiesis or inflammatory disease. See, e.g., Nelson and Martin (eds. 2000) Cytokines in Pulmonary Disease Dekker, N.Y.; Ganser and Hoelzer (eds. 1999) Cytokines in the Treatment of Hematopoietic Failure Dekker, N.Y.: Remick and Friedland (eds. 1997) Cytokines in Health and Disease Dekker, N.Y.; Dinarello (1996) Blood 87:2095-2147; and Thomson (ed. 1994) The Cytokine Handbook Academic Press, San Diego. Ligand and receptors are very important in the signaling process.

VIII. Antibodies Specific for Proteins

Inbred Balb/c mice are immunized intraperitoneally with recombinant forms of the protein, e.g., purified DIRS4 or stable transfected NIH-3T3 cells. Animals are boosted at appropriate time points with protein, with or without additional adjuvant, to further stimulate antibody production. Serum is collected, or hybridomas produced with harvested spleens.

Alternatively, Balb/c mice are immunized with cells transformed with the gene or fragments thereof, either endogenous or exogenous cells, or with isolated membranes enriched for expression of the antigen. Serum is collected at the appropriate time, typically after numerous further administrations. Various gene therapy techniques may be useful, e.g., in producing protein in situ, for generating an immune response. Serum may be immunoselected to prepare substantially purified antibodies of defined specificity and high affinity.

Monoclonal antibodies may be made. For example, splenocytes are fused with an appropriate fusion partner and hybridomas are selected in growth medium by standard procedures. Hybridoma supernatants are screened for the presence of antibodies which bind to the DIRS4, e.g., by ELISA or other assay. Antibodies which specifically recognize specific DIRS4 embodiments may also be selected or prepared.

In another method, synthetic peptides or purified protein are presented to an immune system to generate monoclonal or polyclonal antibodies. See, e.g., Coligan (ed. 1991) Current Protocols in Immunology Wiley/Greene; and Harlow and Lane (1989) Antibodies: A Laboratory Manual Cold Spring Harbor Press. In appropriate situations, the binding reagent is either labeled as described above, e.g., fluorescence or otherwise, or immobilized to a substrate for panning methods. Nucleic acids may also be introduced into cells in an animal to produce the antigen, which serves to elicit an immune response. See, e.g., Wang, et al. (1993) Proc. Nat'l. Acad. Sci. 90:4156-4160; Barry, et al. (1994) BioTechniques 16:616-619; and Xiang, et al. (1995) Immunity 2: 129-135.

Moreover, antibodies which may be useful to determine the combination of the DIRS4 with a functional alpha subunit may be generated. Thus, e.g., epitopes characteristic of a particular functional alpha/beta combination may be identified with appropriate antibodies.

IX. Production of Fusion Proteins

Various fusion constructs are made, e.g., with DIRS4. A portion of the appropriate gene is fused to an epitope tag, e.g., a FLAG tag, or to a two hybrid system construct. See, e.g., Fields and Song (1989) Nature 340:245-246.

The epitope tag may be used in an expression cloning procedure with detection with anti-FLAG antibodies to detect a binding partner, e.g., ligand for the respective cytokine receptor. The two hybrid system may also be used to isolate proteins which specifically bind to DIRS4.

X. Structure Activity Relationship

Information on the criticality of particular residues is determined using standard procedures and analysis. Standard mutagenesis analysis is performed, e.g., by generating many different variants at determined positions, e.g., at the positions identified above, and evaluating biological activities of the variants. This may be performed to the extent of determining positions which modify activity, or to focus on specific positions to determine the residues which can be substituted to either retain, block, or modulate biological activity.

Alternatively, analysis of natural variants can indicate what positions tolerate natural mutations. This may result from populational analysis of variation among individuals, or across strains or species. Samples from selected individuals are analyzed, e.g., by PCR analysis and sequencing. This allows evaluation of population polymorphisms.

XI. Isolation of a Ligand for Receptor

A cytokine receptor can be used as a specific binding reagent to identify its binding partner, by taking advantage of its specificity of binding, much like an antibody would be used. Typically, the binding receptor is a heterodimer of receptor subunits. A binding reagent is either labeled as described above, e.g., fluorescence or otherwise, or immobilized to a substrate for panning methods.

The binding composition is used to screen an expression library made from a cell line which expresses a binding partner, i.e., ligand, preferably membrane associated. Standard staining techniques are used to detect or sort surface expressed ligand, or surface expressing transformed cells are screened by panning. Screening of intracellular expression is performed by various staining or immunofluorescence procedures. See also McMahan, et al. (1991) EMBO J. 10:2821-2832.

For example, on day 0, precoat 2-chamber permanox slides with 1 ml per chamber of fibronectin, 10 ng/ml in PBS, for 30 min at room temperature. Rinse once with PBS. Then plate COS cells at 2−3×10⁵ cells per chamber in 1.5 ml of growth media. Incubate overnight at 37° C.

On day 1 for each sample, prepare 0.5 ml of a solution of 66 μg/ml DEAE-dextran, 66 μM chloroquine, and 4 μg DNA in serum free DME. For each set, a positive control is prepared, e.g., of DIRS4-FLAG cDNA at 1 and 1/200 dilution, and a negative mock. Rinse cells with serum free DME. Add the DNA solution and incubate 5 hr at 37° C. Remove the medium and add 0.5 ml 10% DMSO in DME for 2.5 min. Remove and wash once with DME. Add 1.5 ml growth medium and incubate overnight.

On day 2, change the medium. On days 3 or 4, the cells are fixed and stained. Rinse the cells twice with Hank's Buffered Saline Solution (HBSS) and fix in 4% paraformaldehyde (PFA)/glucose for 5 min. Wash 3× with HBSS. The slides may be stored at −80° C. after all liquid is removed. For each chamber, 0.5 ml incubations are performed as follows. Add HBSS/saponin (0.1%) with 32 μl/ml of 1 M NaN₃ for 20 min. Cells are then washed with HBSS/saponin 1×. Add appropriate DIRS4 or DIRS4/antibody complex to cells and incubate for 30 min. Wash cells twice with HBSS/saponin. If appropriate, add first antibody for 30 min. Add second antibody, e.g., Vector anti-mouse antibody, at 1/200 dilution, and incubate for 30 min. Prepare ELISA solution, e.g., Vector Elite ABC horseradish peroxidase solution, and preincubate for 30 min. Use, e.g., 1 drop of solution A (avidin) and 1 drop solution B (biotin) per 2.5 ml HBSS/saponin. Wash cells twice with HBSS/saponin. Add ABC HRP solution and incubate for 30 min. Wash cells twice with HBSS, second wash for 2 min, which closes cells. Then add Vector diaminobenzoic acid (DAB) for 5 to 10 min. Use 2 drops of buffer plus 4 drops DAB plus 2 drops of H₂O₂ per 5 ml of glass distilled water. Carefully remove chamber and rinse slide in water. Air dry for a few minutes, then add 1 drop of Crystal Mount and a cover slip. Bake for 5 min at 85-90° C.

Evaluate positive staining of pools and progressively subclone to isolation of single genes responsible for the binding.

Alternatively, receptor reagents are used to affinity purify or sort out cells expressing a putative ligand. See, e.g., Sambrook, et al. or Ausubel, et al.

Another strategy is to screen for a membrane bound receptor by panning. The receptor cDNA is constructed as described above. The ligand can be immobilized and used to immobilize expressing cells. Immobilization may be achieved by use of appropriate antibodies which recognize, e.g., a FLAG sequence of a DIRS4 fusion construct, or by use of antibodies raised against the first antibodies. Recursive cycles of selection and amplification lead to enrichment of appropriate clones and eventual isolation of receptor expressing clones.

Phage expression libraries can be screened by mammalian DIRS4. Appropriate label techniques, e.g., anti-FLAG antibodies, will allow specific labeling of appropriate clones.

All citations herein are incorporated herein by reference to the same extent as if each individual publication or patent application was specifically and individually indicated to be incorporated by reference.

Many modifications and variations of this invention can be made without departing from its spirit and scope, as will be apparent to those skilled in the art. The specific embodiments described herein are offered by way of example only, and the invention is to be limited by the terms of the appended claims, along with the full scope of equivalents to which such claims are entitled; and the invention is not to be limited by the specific embodiments that have been presented herein by way of example. 

What is claimed is:
 1. An isolated nucleic acid encoding a polypeptide comprising the sequence of SEQ ID NO:
 9. 2. The nucleic acid of claim 1, wherein the nucleic acid has a sequence of SEQ ID NO:
 8. 3. An expression vector comprising the nucleic acid sequence of claim
 1. 4. A host cell comprising the expression vector of claim
 3. 5. The host cell of claim 4, where in the host cell is selected from the group consisting of a bacterial cell, a mammalian cell, or a yeast cell.
 6. The host cell of claim 5, wherein the mammalian cell is a CHO cell.
 7. A method of expressing the polypeptide of SEQ ID NO: 9, comprising: a) culturing the host cell of claim 4 in a medium suitable for expression of the polypeptide; and b) isolating the polypeptide. 